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PRODUCTION OP HYDROXYIATED PATTY ACIDS 
IN GENETICALLY MODIFIED PLANTS 



TECHNICAL FIELD 

The present invention concerns the identification of 
nucleic acid sequences and constructs, and methods related 
thereto, and the use of these sequences and constructs to 
produce genetically modified plants for the purpose of 
altering the fatty acid composition of plant oils, waxes and 
related compounds. 

DEFINITIONS 

The subject of this invention is a class of enzymes that 
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introduce a hydroxyl group into several different fatty acids 
resulting in the production of several different kinds of 
hydroxy iated fatty acids- In particular, these enzymes 
catalyze hydroxylation of oleic acid to 12-hydroxy oleic acid 
and icosenoic acid to 14-hydroxy icosenoic acid. Other fatty 
acids such as palmitoleic and erucic acids may also be 
substrates. Since it is not possible to refer to the enzyme 
by reference to a unique substrate or product, we refer to the 
enzyme throughout as kappa hydroxylase to indicate that the 
enzyme introduces the hydroxyl three carbons distal (i.e., 
away from the carboxyl carbon of the acyl chain) from a double 
bond located near the center of the acyl chain. 

The following fatty acids are also the subject of this 
invention: ricinoleic acid, 12-hydroxyoctadec-cis-9-enoic acid 
( 120H-18 : i c,sA9 ) ; lesquerolic acid, 14-hydroxy-cis-ll-icosenoic 
acid ( 14OH-20: l c,s * 11 ) ; densipolic acid, 12-hydroxyoctadec-cis- 
9,15-dienoic acid (120H-18 : 2 cl<ai9 * 15 ) ; auricolic acid, 14- 
hydroxy-cis-11, 17-icosadienoic acid ( 14OH-20 : 2 eisAn ' 17 ) ; 
hydroxyerucic, l6-hydroxydocos-cis-l3-enoic acid (160H- 
22:1 C,,A13 ); hydroxypalmit oleic , l2-hydroxyhexadec-cis-9-enoic 
(120H-16:i ci8A9 ) ; icosenoic acid (20: l c,sA11 ) . It will be noted 
that icosenoic acid is spelled eicosenoic acid in some 
countries. 

BACKGROUND 

Extensive surveys of the fatty acid composition of seed 
oils from different species of higher plants have resulted in 
the identification of at least 33 structurally distinct 
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monohydroxylated plant fatty acids, and 12 different 
polyhydroxylated fatty acids that are accumulated by one or 
more plant species (reviewed by van de Loo et al. 1993). 
Ricinoleic acid, the principal constituent of the s^ed oil 
5 from the castor plant Riclnus communis (L.) , is of commercial 

importance. We have previously described the cloning of a 
gene from this species that encodes a fatty acid hydroxylase, 
and the use of this gene to produce ricinoleic acid in 
transgenic plants of other species. The scientific 

10 evidence supporting this claim was published in 

1995 . The use of the castor hydroxylase gene to also produc 
other hydroxylated fatty acids such as lesquerolic acid, 

15 densipolic acid, hydroxypalmitoleic , hydroxyerucic and 

auricolic acid in transgenic plants is the subject of this 
invention. In addition, the identification of a gene encoding 
a homologous hydroxylase from Lesquerella fendleri , and the 
use of this gene to produce these hydroxylated fatty acids in 

20 transgenic plants is the subject of this invention. 

Castor is a minor oilseed crop. Approximately 50% of the 
seed weight is oil (triacylglycerol) in which 85-90% of total 
fatty acids are the hydroxylated fatty acid, ricinoleic acid. 
Oil pressed or extracted from castor seeds has many industrial 

25 uses based upon the properties endowed by the hydroxylated 

fatty acid. The most important uses are production of paints 
and varnishes, nylon-type synthetic polymers, resins, 
lubricants, and cosmetics (Atsmon 1989). In addition to oil, 
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'the castor seed contains the extremely toxic protein ricin, 
allergenic pr teins, and the alkaloid ricinine. These 
constituents preclude the use of the untreated seed meal 
(following oil extraction) as a livestock feed, normally an 
5 important economic aspect of oilseed utilization. 

Furthermore, with the variable nature of castor plants and a 
lack of investment in breeding, castor has few favorable 
agronomic characteristics. For a combination of these 
reasons, castor is no longer grown in the United States and 

10 the development of an alternative domestic source of 

hydroxy lated fatty acids would be attractive. The production 
of ricinoleic acid, the important constituent of castor oil, 
in an established oilseed crop through genetic engineering 
would be a particularly effective means of creating a domestic 

15 source. 

Because there is no practical source of lesquerolic, 
densipolic and auricolic acids from plants that are adapted to 
modern agricultural practices, there is currently no large- 
scale use of these fatty acids by industry. However, the 

20 fatty acids would have uses similar to those of ricinoleic 

acid if they could be produced in large quantities at 
comparable cost to other plant-derived fatty acids (Smith 
1985) . Plant species, such as certain species in the genus 
Lesquerella, that accumulate a high proportion of these fatty 

25 acids, have not been domesticated and are not currently 

considered a practical source of fatty acids (Hirsinger, 
1989) . This invention represents a useful step toward the 
eventual production of these and other hydroxy lated fatty 
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acids in transgenic plants of agricultural importance * 

The taxonomic relationships between plants having similar 
or identical kinds of unusual fatty acids have been examined 
(van de Loo et al., 1993). In some cases, particular fatty 
5 acids occur mostly or solely in related taxa. In other cases 

there does not appear to be a direct link between taxonomic 
relationships and the occurrence of unusual fatty acids. In 
this respect, ricinoleic acid has now been identified in 12 
genera from 10 families (reviewed in van de Loo et al. , 1993) . 

10 Thus, it appears that the ability to synthesize hydroxylated 

fatty acids has evolved several times independently during the 
radiation of the angiosperms. This suggested to us that the 
enzymes which introduce hydroxyl groups into fatty acids arose 
by minor modifications of a related enzyme. Indeed, as shown 

15 herein, the sequence similarity between A 12 fatty acid 

desaturases and the kappa hydroxylase from castor is so high 
that it is not possible to unambiguously determine whether a 
particular enzyme is a desaturase or a hydroxylase on the 
basis of evidence in the scientific literature. Similarly, a 

20 patent application {wo (9 4/11516) that purports to teach the 

isolation and use of A12 fatty acid desaturases does not teach 
how to distinguish a hydroxylase from a desaturase. In view 
of the importance of being able to distinguish between these 
activities for the purpose of genetic engineering of plant 

25 oils, the utility of that application is limited to the 

several instances where direct experimental evidence (e.g., 
altered fatty acid composition in transgenic plants) was 
presented to support the assignment of function. A method for 
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distinguishing between fatty acid desaturases and fatty acid 
hydroxylases on the basis of amino acid sequence of the enzyme 
is also a subject of this invention. 

A feature of hydroxylated or other unusual fatty acids is 
5 that they are generally confined to seed triacylglycerols, 

being largely excluded from the polar lipids by unknown 
mechanisms (Battey and Ohlrogge 1989; Prasad et al., 1987). 
This is particularly intriguing since diacylglycerol is a 
precursor of both triacylglycerol and polar lipid. With 

10 castor microsomes, there is some evidence that the pool of 

ricinoleoyl-containing polar lipid is minimized by a 
preference of diacylglycerol acyltransf erase for ricinoleate- 
containing diacylglycerols (Bafor et al. 1991) . Analyses of 
vegetative tissues have generated few reports of unusual fatty 

15 acids, other than those occurring in the cuticle. The cuticle 

contains various hydroxylated fatty acids which are 
interesterif ied to produce a high molecular weight polyester 
which serves a structural role. A small number of other 
exceptions exist in which unusual fatty acids are found in 

20 tissues other than the seed. 

The biosynthesis of ricinoleic acid from oleic acid in 
the developing endosperm of castor {Rlcinus communis) has been 
studied by a variety of methods. Morris (1967) established in 
double- labeling studies that hydroxy lation occurs directly by 

25 hydroxy 1 substitution rather than via an unsaturated-, keto- 

or epoxy- intermediate . Hydroxylation using oleoyl-CoA as 
precursor can be demonstrated in crude preparations or 
microsomes, but activity in microsomes is unstable and 

- 6 - 
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variable, and is lation of the microsomes involved a 
considerable, or sometimes complete loss of activity (Galliard 
and Stumpf, 1966; Moreau and Stumpf, 1981). Oleic acid can 
replace oleoyl-CoA as a precursor, but only in the presence of 
CoA, Mg 2 * and ATP (Galliard and Stumpf, 1966) indicating that 
activation to the acyl-CoA is necessary. However, no 
radioactivity could be detected in ricinoleoyl-CoA (Moreau and 
Stumpf, 1981) . These and more recent observations (Bafor et 
al., 1991) have been interpreted as evidence that the 
substrate for the castor oleate hydroxylase is oleic acid 
esterified to phosphatidylcholine or another phospholipid. 

The hydroxylase is sensitive to cyanide and azide, and 
dialysis against metal chelators reduces activity, which could 
be restored by addition of FeS0 4 , suggesting iron involvement 
in enzyme activity (Galliard and Stumpf, 1966) . Ricinoleic 
acid synthesis requires molecular oxygen (Galliard and Stumpf, 
1966; Moreau and Stumpf 1981) and requires NAD ( P ) H to reduce 
cytochrome b5 which is thought to be the intermediate electron 
donor for the hydroxylase reaction (Smith et al., 1992). 
Carbon monoxide does not inhibit hydroxy lation, indicating 
that a cytochrome P4 50 is not involved (Galliard and Stumpf, 
1966; Moreau and Stumpf 1981) . Data from a study of the 
substrate specificity of the hydroxylase show that all 
substrate parameters (i.e., chain length and double bond 
position with respect to both ends) are important; deviations 
in these parameters caused reduced activity relative to oleic 
acid (Howling et al., 1972). The position at which the 
hydroxy 1 was introduced, however, was determined by the 

- 7 - 
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' position of the double bond, always being three carbons 
distal. Thus, the castor acyl hydroxylase enzyme can produce 
a family of different hydroxy lated fatty acids depending on 
the availability of substrates. Thus, as a matter of 
5 convenience, we refer to the enzyme throughout as a kappa 

hydroxylase (rather than an oleate hydroxylase) to indicate 
the broad substrate specificity. 

The castor kappa hydroxylase has many superficial 
similarities to the microsomal fatty acyl desaturases (Browse 

10 and Somerville, 1991) . In particular, plants have a 

microsomal oleate desaturase active at the Al2 position* The 
substrate of this enzyme (Schmidt et al. ( 1993) and of the 
hydroxylase (Bafor et al., 1991) appears to be a* fatty acid 
esterified to the sn-2 position of phosphatidylcholine. When 

15 oleate is the substrate, the modification occurs at the same 

position (Al2) in the carbon chain, and requires the same 
cof actors, namely electrons from NADH via cytochrome b 5 and 
molecular oxygen. Neither enzyme is inhibited by carbon 
monoxide (Moreau and Stumpf , 1981) , the characteristic 

20 inhibitor of cytochrome P450 enzymes. 

There do not appear to have been any published 
biochemical studies of the properties of the hydroxylase 
enzyme (s) in Lesguerella. 



25 Conceptu al basis of the invention 

An earlier application has described 
the use of a cDNA clone from castor for the production of 
ricinoleic acid in transgenic plants. As noted above, 

- 8 - 
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biochemical studies by others had suggested that the castor 
hydroxylase may not have strict specificity for oleic acid but 
would also catalyze hydroxy lation of other fatty acids such as 
icosenoic acid (20:l ciwW1 ) (Howling et al., 1972). Based on 
5 these studies, 

the expression of the castor hydroxylase in 
transgenic plants of species such as Brasslca napus and 
Arabidopsis thai i ana that accumulate fatty acids such as 
icosenoic acid (20:l cisAn ) and erucic acid (13-docosenoic acid; 

10 22:i cifi&13 ) would be expected to accumulate some of the 

hydroxylated derivatives of these fatty acids due to the 
activity of the hydroxylase on these fatty acids. We have now 
obtained additional direct evidence for such a claim based on 
the production of ricinoleic, lesquerolic, densipolic and 

15 auricolic fatty acids in transgenic Arabidopsis plants and 

have included such evidence herein as Example l. 

We have previously disclosed 
the various methods by which the castor hydroxylase clone and 
sequences derived thereof could be used to identify other 

20 hydroxylase clones from plant species such as Lesqverella 

fendleri that are known to accumulate hydroxylated fatty acids 
in seed oils. In this continuation we have provided an 
example of the use of that aspect of the invention for the 
isolation of a novel hydroxylase gene from Lesquerella 

25 fendleri. 

In view of the high degree of sequence similarity between 
Al2 fatty acid desaturases and the castor hydroxylase (van de 
Loo et al., 1995); the validity of claims for the use of 

- 9 - 
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' desaturase or hydroxylase genes or sequences derived therefrom 
for the identification f genes of identical function from 
other speci s must be viewed with skepticism. In this 
application, we teach a method by which hydroxylase genes can 
5 be distinguished from desaturases and describe methods by 

which A 12 desaturases can be converted to hydroxylases by the 
modification of the gene encoding the desaturases. A 
mechanistic basis for the similar reaction mechanisms of 
desaturases and hydroxylases was presented in an earlier 

10 patent application. Briefly, the available 

evidence suggests that fatty acid desaturases have a similar 
reaction mechanism to the bacterial enzyme methane 
monooxygenase which catalyses a reaction involving oxygen-atom 
transfer (CH 6 - CH 3 OH) (van de Loo et al. , 1993). The cofactor 

15 in the hydroxylase component of methane monooxygenase is 

termed a #i-oxo bridged diiron cluster (FeOFe) . The two iron 
atoms of the FeOFe cluster are liganded by protein-derived 
nitrogen or oxygen atoms, and are tightly redox-coupled by the 
covalently-bridging oxygen atom. The FeOFe cluster accepts 

20 two electrons, reducing it to the diferrous state, before 

oxygen binding. Upon oxygen binding, it is likely that 
heterolytic cleavage also occurs, leading to a high valent 
oxoiron reactive species that is stabilized by resonance 
rearrangements possible within the tightly coupled FeOFe 

2 5 cluster. The stabilized high-valent oxoiron state of methane 

monooxygenase is capable of proton extraction from methane, 
followed by oxygen transfer, giving methanol. The FeOFe 
cofactor has been shown to be directly relevant to plant fatty 

- 10 - 



6NSDOCID: <WO .9610075A1_I_> 



WO 96/10075 PCT/US95/1 1855 

acid modifications by the demonstration that castor stearoyl- 
ACP desaturase contains this type of cof actor (Fox et al. , 
1993). 

On the basis of the foregoing considerations, we 
5 hypothesized that the castor oleate hydroxylase is a 

structurally modified fatty acyl desaturase, based upon three 
arguments. The first argument involves the taxonomic 
distribution of plants containing ricinoleic acid. Ricinoleic 
acid has been found in 12 genera of 10 families of higher 

10 plants (reviewed in van de Loo et al., 1993). Thus, plants in 
which ricinoleic acid occurs are found throughout the plant 
kingdom, yet close relatives of these plants do not contain 
the unusual fatty acid. This pattern suggests that the 
ability to synthesize ricinoleic acid has arisen (and been 

15 lost) several times independently, and is therefore a quite 

recent divergence. In other words, the ability to synthesize 
ricinoleic acid has evolved rapidly, suggesting that a 
relatively minor genetic change in the structure of the 
ancestral enzyme was necessary to accomplish it. 

20 The second argument is that many biochemical properties 

of castor kappa hydroxylase are similar to those of the 
microsomal desaturases, as discussed above (e.g., both 
preferentially act on fatty acids esterified to the sn-2 
position of phosphatidylcholine, both use cytochrome b5 as an 

25 intermediate electron donor, both are inhibited by cyanide, 

both require molecular oxygen as a substrate, both are thought 
to be located in the endoplasmic reticulum) . 

The third argument stems from the discuss i n of oxygenase 

- 11 - 
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cof actors above, in which it is suggested that the plant 
membrane bound fatty acid desaturases may have a ji-oxo bridged 
diiron cluster-type cof actor, and that such cof actors are 
capable of catalyzing both fatty acid desaturations and 
hydroxylations , depending upon the electronic and structural 
properties of the protein active site. 

Taking these three arguments together, it was 
hypothesized that kappa hydroxylase of castor endosperm is 
homologous to the microsomal oleate A 12 desaturase found in 
all plants. The evidence supporting this hypothesis h*as been 
disclosed. 

A number of genes encoding microsomal A12 desaturases from 
various species have recently been cloned (Okuley et al. , 
1994) and substantial information about the structure of these 
enzymes is now known. Hence, in the following invention we 
teach how to use structural information about fatty acyl 
desaturases to isolate kappa hydroxylase genes of this 
invention. This example teaches the method by which any 
carbon-monoxide insensitive plant fatty acyl hydroxylase gene 
can be identified by one skilled in the art. 



BRIEF DESCRIPTION OF THE DRAWINGS 



Figures 1A-D show the mass spectra of hydroxy fatty acids 
standards (Figure 1A, O-TMS-methylricinoleate; Figure IB, O- 
TMS-methyl densipoleate ; Figure 1C, O-TMS-methyl- 
lesqueroleate; and Figure ID, O-TMS-methylauricoleate . 

Figure 2 shows the fragmentation pattern of 

- 12 - 
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trim thylsilylated m thyl esters of hydroxy fatty acids. 

Figure 3A shows the gas chroma togr am of fatty acids 
extracted from seeds of wild type Arabidopsis plants. Figure 
3B shows the gas chromatogram of fatty acids extracted from 
5 seeds of transgenic Arabidopsis plants containing the fahl2 

hydroxylase gene. The numbers indicate the following fatty 
acids: [1] 16:0; [2] 18:0; [3] i8:lcisA9; [4] 18 : 2 cifiA9 ' 12 ; [5] 
20:0; [6] 20:l cUa "; [ 7] 18 : 3 cis * 9 ' 12 ' 1 V [8 ] 22 : l ci ** 13 ; [9] 24:l c ^ 13 ; 
[I0]ricinoleic acid; [11] densipolic acid; [12] lesquerolic 
10 acid; [13] auricolic acid. 

Figures 4A-D show the mass spectra of novel fatty acids 
found in seeds of transgenic plants. Figure 4A shows the mass 
spectrum of peak 10 from Figure 3B. Figure 4B shows the mass 
spectrum of peak 11 from Figure 3B. Figure 4C shows the mass 
15 spectrum of peak 12 from Figure 3B. Figure 4D shows the mass 

spectrum of peak 13 from Figure 3B. 

Figure 5 shows the nucleotide sequence of pLesq2 (SEQ ID 
NO:l) . 

Figure 6 shows the nucleotide sequence of pLesq3 (SEQ ID 
20 NO:2). 

Figure 7 shows a Northern blot of total RNA from seeds of 
L . fendlerl probed with pLesq2 or pLesq3 . S, indicates RNA is 
from seeds; L, indicates RNA is from leaves. 

Figures 8A-B show the nucleotide sequence of genomic 
2 5 clone encoding pLesq-HYD (SEQ ID NO: 3), and the deduced amino 

acid sequence of hydroxylase enzyme encoded by the gene (SEQ 
ID NO: 4) . 

Figures 9A-B show multiple sequence alignment of deduced 

- 13 - 
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amino acid s guences for kappa hydroxylas s and microsomal A12 
desaturases. Abbreviations are: Rcfahl2, fahl2 hydroxylase 
gene from R. communis (van de Loo et al., 1995); Lffahl2, 
kappa hydroxylase gene from L. fendleri; Atfad2, faa2 
5 desaturase from Arabidopsis thaliana (Okuley et al . , 1994) ; 

Gmfad2-1, fad2 desaturase from Glycine max (GenBank accession 
number L43920) ; Gmfad2-2, fad2 desaturase from Glycine max 
(Genbank accession number L43921) ; Zmf ad2 , fad2 desaturase 
from Zea mays (wo 94/11516 ); Rcfad2, fragment of fad2 

10 desaturase from R. communis ( wo 9 4/11516 ; Bnf ad2 , fad2 

desaturase from Brassica napus ( WO/ 94/11516 ) ; LFFAH12 . AMI , 
SEQ ID NO: 4; FAH 1 2 . AMI , SEQ ID NO: 5; ATFAD2 . AMI , SEQ ID NO: 6; 
BNFAD2 .AMI , SEQ ID NO:7; GMFAD2 - 1 . AMI , SEQ ID NO:8; GMFAD2- 
2. AMI, SEQ ID NO:9; ZMFAD2 . AMI , SEQ ID NO:10; and RCFAD2 . AMI , 

15 SEQ ID NO: 11. 

Figure 10 shows a Southern blot of genomic DNA from L . 
fendleri probed with pLesq-HYD. E=EcoRI, H = Hindlll, x = 
Xbal. 

Figure 11 shows a map of binary Ti plasmid pSLJ44024. 

20 

SUMMARY O F THE INVENTION 

This invention relates to plant fatty acyl hydroxylases. 
Methods to use conserved amino acid or nucleotide sequences to 
25 obtain plant fatty acyl hydroxylases are described. Also 

described is the use of cDNA clones encoding a plant 
hydroxylase to produce a family of hydroxylated fatty acids in 

- 14 - 
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transgenic plants. 

In a first embodiment, this invention is directed to 
recombinant DNA constructs which can provide for the 
transcription or transcription and translation (expression) of 
5 the plant kappa hydroxylase sequence. In particular, 

constructs which are capable of transcription or transcription 
and translation in plant host cells are preferred. Such 
constructs may contain a variety of regulatory regions 
including transcriptional initiation regions obtained from 

10 genes preferentially expressed in plant seed tissue. In a 

second aspect, this invention relates to the presence of such 
constructs in host cells, especially plant host cells which 
have an expressed plant kappa hydroxylase therein. 

In yet another aspect, this invention relates to a method 

15 for producing a plant kappa hydroxylase in a host cell or 

progeny thereof via the expression of a construct in the cell. 
Cells containing a plant kappa hydroxylase as a result of the 
production of the plant kappa hydroxylase encoding sequenc 
are also contemplated herein. 

20 in another embodiment, this invention relates to meth ds 

of using a DNA sequence encoding a plant kappa hydroxylase for 
the modification of the proportion of hydroxylated fatty acids 
produced within a cell, especially plant cells. Plant cells 
having such a modified hydroxylated fatty acid composition are 

25 also contemplated herein. 

In a further aspect of this invention, plant kappa 
hydroxylase proteins and sequences which are related thereto, 
including amino acid and nucleic acid sequences, are 

- 15 - 
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contemplated. Plant kappa hydroxylase exemplified herein 
includes a Lesquerella fendleri fatty acid hydroxylas . This 
exemplified fatty acid hydroxylase may be used to obtain other 
plant fatty acid hydroxylases of this invention. 

In a further aspect of this invention, a nucleic acid 
sequence which directs the seed specific expression of an 
associated polypeptide coding sequence is described. The use 
of this nucleic acid sequence or fragments derived thereof, to 
obtain seed-specific expression in higher plants of any coding 
sequence is contemplated herein. 

DETAILED DESCRIPTION OF THE INVENTION 

A genetically transformed plant of the present invention 
which accumulates hydroxy lated fatty acids can be obtained by 
expressing the double-stranded DNA molecules described in this 
application. 

A plant fatty acid hydroxylase of this invention includes 
any sequence of amino acids, such as a protein, polypeptide or 
peptide fragment, or nucleic acid sequences encoding such 
polypeptides, obtainable from a plant source which 
demonstrates the ability to catalyze the production of 
ricinoleic, lesquerolic, hydroxyerucic ( 16-hydroxydocos-cis- 
13-enoic acid) or hydroxypalmitoleic ( 12-hydroxyhexadec-cis-9- 
enoic) from CoA, ACP or lipid-linked monoenoic fatty acid 
substrates under plant enzyme reactive conditions. By *' nzyme 
reactive conditions" is meant that any necessary conditi ns 
are available in an environment (i.e., such factors as 
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temperature, pH, lack of inhibiting substances) which will 
permit the enzyme to function. 

Preferential activity of a plant fatty acid hydroxylase 
toward a particular fatty acyl substrate is determined upon 
5 comparison of hydroxylated fatty acid product amounts obtained 

per different fatty acyl substrates. For example, by "oleate 
preferring" is meant that the hydroxylase activity of the 
enzyme preparation demonstrates a preference for oleate- 
containing substrates over other substrates. Although the 

10 precise substrate of the castor fatty acid hydroxylase is not 

known, it is thought to be a monounsaturated fatty acid moiety 
which is esterified to a phospholipid such as 
phosphatidylcholine. However, it is also possible that 
monounsaturated fatty acids esterified to 

15 phosphatidylethanolamine, phosphatidic acid or a neutral lipid 

such as diacylglycerol or a Coenzyme-A thioester may also be 
substrates. As noted ,above, significant activity has been 
observed in radioactive labelling studies using fatty acyl 
substrates other than oleate (Howling et al., 1972) indicating 

20 that the substrate specificity is for a family of related 

fatty acyl compounds. Because the castor hydroxylase 
introduces hydroxy groups three carbons from a double bond, 
proximal to the methyl carbon of the fatty acid we term the 
enzyme a kappa hydroxylase for convenience. Of particular 

25 interest, we envision that the castor kappa hydroxylase may be 

used for production of l2~hydroxy-9-octadecenoic acid 
(ricinoleate) , l2-hydroxy-9-hexadecenoic acid, 14-hydroxy-ll- 
eicosenoic acid, l6-hydroxy-13-docosanoic acid, 9-hydroxy-6- 
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octadecenoic acid by expression in plants species which 
produce the non-hydroxylated precursors. We also envisi n 
production of additionally modified fatty acids such as 12- 
hydroxy-9 , 15-octadecadienoic acid that result from 
desaturation of hydroxylated fatty acids (e.g., 12-hydroxy-9- 
octadecenoic acid in this example) . 

We also envision that future advances in the genetic 
engineering of plants will lead to production of substrate 
fatty acids, such as icosenoic acid esters, and palmitoleic 
acid esters in plants that do not normally accumulate such 
fatty acids. We envision that the invention described herein 
may be used in conjunction with such future improvements to 
produce hydroxylated fatty acids of this invention in any 
plant species that is amenable to directed genetic 
modification. Thus, the applicability of this invention is 
not limited in our conception only to those species that 
currently accumulate suitable substrates. 

As noted above, a plant kappa hydroxylase of this 
invention will display activity towards various fatty acyl 
substrates. During biosynthesis of lipids in a plant cell, 
fatty acids are typically covalently bound to acyl carrier 
protein (ACP) , coenzyme A (CoA) or various cellular lipids. 
Plant kappa hydroxylases which display preferential activity 
toward lipid-linked acyl substrate are especially preferred 
because they are likely to be closely associated with normal 
pathway of storage lipid synthesis in immature embryos. 
However, activity toward acyl-CoA substrates or other 
synthetic substrates, for example, is also contemplated 
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herein. 

Other plant kappa hydroxylases are obtainable from the 
specific exemplified sequences provided herein. Furthenn re, 
it will be apparent that one can obtain natural and synthetic 
plant kappa hydroxylases including modified amino acid 
sequences and starting materials for synthetic-protein 
modeling from the exemplified plant kappa hydroxylase and from 
plant kappa hydroxylases which are obtained through the use of 
such exemplified sequences. Modified amino acid sequences 
include sequences which have been mutated, truncated, 
increased and the like, whether such sequences were partially 
or wholly synthesized. Sequences which are actually purified 
from plant preparations or are identical or encode identical 
proteins thereto, regardless of the method used to obtain the 
protein or sequence, are equally considered naturally derived. 

Thus, one skilled in the art will readily recognize that 
antibody preparations, nucleic acid probes (DNA and RNA) and 
the like may be prepared and used to screen and recover 
"homologous" or "related" kappa hydroxylases from a variety of 
plant sources. Typically, nucleic acid probes are labeled to 
allow detection, preferably with radioactivity although 
enzymes or other methods may also be used. For immunological 
screening methods, antibody preparations either monoclonal or 
polyclonal are utilized. Polyclonal antibodies, although less 
specific, typically are more useful in gene isolation. For 
detection, the antibody is labeled using radioactivity or any 
one of a variety of second antibody/enzyme conjugate systems 
that are commercially available. 
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H mologous s quences ar found when there is an id ntity 
of sequence and may be determined upon comparison of sequence 
information, nucleic acid or amino acid, or through 
hybridization reactions between a known kappa hydroxylase and 
a candidate source. Conservative changes, such as Glu/Asp, 
Val/Ile, Ser/Thr, Arg/Lys and Gln/Asn may also be considered 
in determining sequence homology. Typically, a lengthy 
nucleic acid sequence may show as little as 50-60% sequence 
identity, and more preferably at least about 70% sequence 
identity, between the target sequence and the given plant 
kappa hydroxylase of interest excluding any deletions which 
may be present, and still be considered related. Amino acid 
sequences are considered homologous by as little as 25% 
sequence identity between the two complete mature proteins. 
15 (See generally, Doolittle, R.F., OF URFS and ORFS, University 

Science Books, CA, 1986.) 

A genomic or other appropriate, library prepared from the 
candidate plant source of interest may be probed with 
conserved sequences from the plant kappa hydroxylase to 
20 identify homologously related sequences. Use of an entire 

cDNA or other sequence may be employed if shorter probe 
sequences are not identified. Positive clones are then 
analyzed by restriction enzyme digestion and/or sequencing. 
When a genomic library is used, one or more sequences may be 
25 identified providing both the coding region, as well as th 

transcriptional regulatory elements of the kappa hydroxylase 
gene from such plant source. Probes can also be considerably 
shorter than the entire sequence. Oligonucleotides may be 
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used, for example, but should be at least about 10, preferably 
at least abou^ 15, more preferably at least 20 nucleotides in 
length. When shorter length regions are used for comparison, 
a higher degree of sequence identity is required than for 
5 longer sequences. Shorter probes are often particularly 

useful for polymerase chain reactions (PCR) , especially when 
highly conserved sequences can be identified (See Gould, et 
al., 1989 for examples of the use of PCR to isolate homologous 
genes from taxonomically diverse species) . 

10 When longer nucleic acid fragments are employed (>100 bp) 

as probes, especially when using complete or large cDNA 
sequences, one would screen with low stringencies (for 
example, 40-50°C below the melting temperature of the probe) 
in order to obtain signal from the target sample with 20-50% 

15 deviation, i.e., homologous sequences. (Beltz, et ai . 1983). 

In a preferred embodiment, a plant kappa hydroxylase of 
this invention will have at least 60% overall amino acid 
sequence similarity with the exemplified plant kappa 
hydroxylase. In particular, kappa hydroxylases which are 

20 obtainable from an amino acid or nucleic acid sequence of a 

castor or lesquerella kappa hydroxylase are especially 
preferred. The plant kappa hydroxylases may have preferential 
activity toward longer or shorter chain fatty acyl substrates. 
Plant fatty acyl hydroxylases having oleate-l2-hydroxylase 

25 activity and eicosenoate- 14 -hydroxylase activity are both 

considered homologously related proteins because of in vitr 
evidence (Howling et al., 1972), and evidence disclosed 
herein, that the castor kappa hydroxylase will act on both 
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substrates. Hydroxy lated fatty acids may be subject to 
further enzymatic modification by other enzymes which are 
normally present or are introduced by genetic engineering 
methods. For example, 14-hydroxy-il , 17-eicosadienoic acid, 
which is present in some Lesquerella species (Smith 1985) , is 
thought to be produced by desaturation of 14 -hydroxy- 11- 
eicosenoic acid. 

Again, not only can gene clones and materials derived 
thereof be used to identify homologous plant fatty acyl 
hydroxylases, but the resulting sequences obtained therefrom 
may also provide a further method to obtain plant fatty acyl 
hydroxylases from other plant sources. In particular, PCR may 
be a useful technique to obtain related plant fatty acyl 
hydroxylases from sequence data provided herein. One skilled 
in the art will be able to design oligonucleotide probes based 
upon sequence comparisons or regions of typically highly 
conserved sequence. Of special interest are polymerase chain 
reaction primers based on the conserved regions of amino acid 
sequence between the castor kappa hydroxylase and the L. 
fendleri hydroxylase (SED ID NO:4) . Details relating to the 
design and methods for a PCR reaction using these probes are 
described more fully in the examples. 

It should also be noted that the fatty acyl hydroxylases 
of a variety of sources can be used to investigate fatty acid 
hydroxylation events in a wide variety of plant and in vivo 
applications. Because all plants synthesize fatty acids via a 
common metabolic pathway, the study and/or application of one 
plant fatty acid hydroxylase to a heterologous plant host may 
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be readily achieved in a variety of species. 

Once the nucleic acid sequence is obtained, the 
transcription, or transcription and translation (expression) , 
of the plant fatty acyl hydroxylases in a host cell is desir d 
5 to produce a ready source of the enzyme and/ or modify the 

composition of fatty acids found therein in the form of free 
fatty acids, esters (particularly esterified to glycerolipids 
or as components of wax esters) , estolides, or ethers. Other 
useful applications may be found when the host cell is a plant 
10 host cell, in vitro and in vivo, 

For example, by increasing the amount of an kappa 
hydroxylase available to the plant, an increased percentag of 
ricinoleate or lesqueroleate ( 14-hydroxy-ll-eicosenoic acid) 
may be provided. 

15 

Kappa hydroxylase 



By this invention, a mechanism for the biosynthesis of 
ricinoleic acid in plants is demonstrated. Namely, that a 

20 specific plant kappa hydroxylase having preferential activity 
toward fatty acyl substrates is involved in the accumulation 
of hydroxy la ted fatty acids in at least some plant species. 
The use of the terms ricinoleate or ricinoleic acid (or 
lesqueroleate or lesquerolic acid, densipoleate etc.) is 

25 intended to include the free acids, the ACP and CoA esters, 

the salts of these acids, the glycero lipid esters 
(particularly the triacylglycerol esters) , the wax esters, the 
estolides and the ether derivatives of these acids. 

- 23 - 



BNSDOCID: <WO 9610075A1J_> 



WO 96/10075 



PCI7US95/U855 



The determination that plant fatty acyl hydroxylases are 
activ in the In vivo production of hydroxylated fatty acids 
suggests several possibilities for plant enzyme sources. And 
in fact, hydroxylated fatty acids are found in some natural 
plant species in abundance. For example, three hydroxy fatty 
acids related to ricinoleate occur in major amounts in seed 
oils from various Lesquerella species. Of particular 
interest, lesquerolic acid is a 20 carbon homolog of 
ricinoleate with two additional carbons at the carboxyl end of 
the chain (Smith 198 5) . Other natural plant sources of 
hydroxylated fatty acids include but are not limited to seeds 
of the Linum genus, seeds of Wrightia species, Lycopodium 
species, Strophanthus species, Convolvulaces species, 
Calendula species and many others (van de Loo et al., 1993). 

Plants having significant presence of ricinoleate or 
lesqueroleate or desaturated other or modified derivatives of 
these fatty acids are preferred candidates to obtain 
naturally-derived kappa hydroxylases. For example, 
Lesguerelia densipila contains a diunsaturated 18 carbon fatty 
acid with a hydroxyl group (van de Loo et al. , 1993) that is 
thought to be produced by an enzyme that is closely related to 
the castor kappa hydroxylase, according to the theory on which 
this invention is based. In addition, a comparison between 
kappa hydroxylases and between plant fatty acyl hydroxylas s 
which introduce hydroxyl groups at positions other than the 
12-carbon of oleate or the 14-carbon of lesqueroleate or on 
substrates other than oleic acid and icosenoic acid may yield 
insights for protein modeling or other modifications to create 
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synthetic hydroxylases as discussed above. For example, on 
the basis of information gained from structural comparisons of 
the A 12 desaturases and the kappa hydroxylase, we envision 
making genetic modifications in the structural genes for A 12 
5 desaturases that convert these desaturases to kappa- 

hydroxylases. We also envision making changes in A15 
hydroxylases that convert these to hydroxylases with 
comparable substrate specificity to the desaturases (e.g., 
conversion of 18 :2^' 12 to 150H-18 : 2* 9 ' 12 . Since the difference 

10 between a hydroxylase and a desaturases concerns the 

disposition of one proton, we envision that by systematically 
changing the charged groups in the region of the enzyme near 
the active site, we can effect this change. 

Especially of interest are fatty acyl hydroxylases which 

15 demonstrate activity toward fatty acyl substrates other than 

oleate, or which introduce the hydroxy 1 group at a location 
other than the C12 carbon. As described above, other plant 
sources may also provide sources for these enzymes through the 
use of protein purification, nucleic acid probes, antibody 

20 preparations, protein modeling, or sequence comparisons, for 

example, and of special interest are the respective amino acid 
and nucleic acid sequences corresponding to such plant fatty 
acyl hydroxylases. Also as previously described, once a 
nucleic acid sequence is obtained for the given plant 

25 hydroxylase, further plant sequences may be compared and/or 

probed to obtain homologously related DNA sequences thereto 
and so on. 
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Gepetic Engineering Applications 

As is well known in the art, once a cDKA clone encoding a 
plant kappa hydroxylase is obtained, it nay be used to obtain 
its corresponding genomic nucleic acid sequences thereto* 

The nucleic acid sequences which encode plant kappa 
hydroxylases may be used in various constructs, for exampl , 
as probes to obtain further sequences from the same or other 
species. Alternatively, these sequences may be used in 
conjunction with appropriate regulatory sequences to increase 
levels of the respective hydroxylase of interest in a host 
cell for the production of hydroxylated fatty acids or study 
of the enzyme in vitro or in vivo or to decrease or increase 
levels of the respective hydroxylase of interest for some 
applications when the host cell is a plant entity, including 
plant cells, plant parts (including but not limited to seeds, 
cuttings or tissues) and plants. 

A nucleic acid sequence encoding a plant kappa 
hydroxylase of this invention may include genomic, cDNA or 
mRHA sequence. By "encoding" is meant that the sequence 
corresponds to a particular amino acid sequence either in a 
sense or anti-sense orientation. By "recombinant" is meant 
that the sequence contains a genetically engineered 
modification through manipulation via mutagenesis, restriction 
enzymes, and the like. A cDNA sequence may or may not encode 
pre-processing sequences, such as transit or signal peptide 
sequences. Transit or signal peptide sequences facilitate the 
delivery of the protein to a given organelle and are 
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frequently cleaved from the polypeptide upon entry into the 
organelle, releasing the "mature" sequence. The use of the 
precursor DNA sequence is preferred in plant cell expression 
cassettes. 

5 Furthermore, as discussed above the complete genomic 

sequence of the plant kappa hydroxylase may be obtained by th 
screening of a genomic library with a probe, such as a cDNA 
probe, and isolating those sequences which regulate expression 
in seed tissue. In this manner, the transcription and 

10 translation initiation regions, introns, and/or transcript 

termination regions of the plant kappa hydroxylase may be 
obtained for use in a variety of DNA constructs, with or 
without the kappa hydroxylase structural gene. Thus, nucleic 
acid sequences corresponding to the plant kappa hydroxylase of 

15 this invention may also provide signal sequences useful to 

direct transport into an organelle 5' upstream non-coding 
regulatory regions (promoters) having useful tissue and timing 
. profiles, 3 1 downstream non-coding regulatory region useful as 
transcriptional and translational regulatory regions and may 

20 lend insight into other features of the gene. 

Once the desired plant kappa hydroxylase nucleic acid 
sequence is obtained, it may be manipulated in a variety of 
ways. Where the sequence involves non-coding flanking 
regions, the flanking regions may be subjected to resection, 

25 mutagenesis, etc. Thus, transitions, transversions, 

deletions, and insertions may be performed on the naturally 
occurring sequence. In addition, all or part of the sequence 
may be synthesized. In the structural gene, one or more 
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codons may be m dif ied to provide for a modified amino acid 
sequence, or one or more codon mutations may be introduc d to 
provide for a convenient restriction site or other purpose 
involved with construction or expression. The structural gene 
may be further modified by employing synthetic adapters, 
linkers to introduce one or more convenient restriction sites, 
or the like. 

The nucleic acid or amino acid sequences encoding a plant 
kappa hydroxylase of this invention may be combined with other 
non-native, or "heterologous " , sequences in a variety of ways. 
By "heterologous" sequences is meant any sequence which is not 
naturally found joined to the plant kappa hydroxylase, 
including, for example, combination of nucleic acid sequ nces 
from the same plant which are not naturally found joined 
together . 

The DNA sequence encoding a plant kappa hydroxylase of 
this invention may be employed in conjunction with all or part 
of the gene sequences normally associated with the kappa 
hydroxylase. In its component parts, a DNA sequence encoding 
kappa hydroxylase is combined in a DNA construct having, in 
the 5' to 3* direction of transcription, a transcription 
initiation control region capable of promoting transcription 
and translation in a host cell, the DNA sequence encoding 
plant kappa hydroxylase and a transcription and translation 
termination region. 

Potential host cells include both prokaryotic and 
eukaryotic cells. A host cell may be unicellular or found in 
a multic llular differentiated or undifferentiated organism 
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depending upon the intended use. Cells of this invention may 
be distinguish d by having a plant kappa hydroxylase foreign 
to the wiid-type cell present therein, for example, by having 
a recombinant nucleic acid construct encoding a plant kappa 
5 hydroxylase therein. 

Depending upon the host, the regulatory regions will 
vary, including regions from viral, plasmid or chromosomal 
genes, or the like* For expression in prokaryotic or 
eukaryotic microorganisms , particularly unicellular hosts, a 

10 wide variety of constitutive or regulatable promoters may be 

employed. Expression in a microorganism can provide a ready 
source of the plant enzyme. Among transcriptional initiation 
regions which have been described are regions from bacterial 
and yeast hosts, such as J5\ coll, B . subtilis, Saccharomyces 

15 cerevisiae , including genes such as beta-galactosidase , T7 

polymerase, tryptophan E and the like. 

For the most part, the constructs will involve regulatory 
regions functional in plants which provide for modified 
production of plant kappa hydroxylase with resulting 

20 modification of the fatty acid composition. The open reading 

frame, coding for the plant kappa hydroxylase or functional 
fragment thereof will be joined at its 5 1 end to a 
transcription initiation regulatory region such as the wild- 
type sequence naturally found 5' upstream to the kappa 

25 hydroxylase structural gene. Numerous other transcription 

initiation regions are available which provide for a wide 
variety of constitutive or regulatable, e.g., inducible, 
transcription of the structural gene functions. Among 
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transcriptional initiation regions used for plants are such 
r gions associated with the structural genes such as for 
nopaline and mannopine synthases, or with napin, soybean p- 
conglycinin, oleosin, 12S storage protein, the cauliflower 
5 mosaic virus 35S promoters and the like- The transcripti n/ 

translation initiation regions corresponding to such 
* structural genes are found immediately 5' upstream to th 
respective start codons. In embodiments wherein the 
expression of the kappa hydroxylase protein is desired in a 
10 plant host, the use of all or part of the complete plant kappa 

hydroxylase gene is desired; namely all or part of the 5* 
upstream non-coding regions (promoter) together with the 
structural gene sequence and 3 1 downstream non-coding regions 
may be employed. If a different promoter is desired, such as 
15 a promoter native to the plant host of interest or a modified 

promoter, i.e. , having transcription initiation regions 
derived from one gene source and translation initiation 
regions derived from a different gene source, including the 
sequence encoding the plant kappa hydroxylase of interest, or 
20 enhanced promoters, such as double 35S CaMV promoters, the" 

sequences may be joined together using standard techniques. 

For such applications when 5' upstream non-coding regions 
are obtained from other genes regulated during seed 
maturation, those preferentially expressed in plant embryo 
25 tissue, such as transcription initiation control regions fr m 

the B . napus napin gene, or the Arabidopsis 12S storage 
protein, or soybean 0-conglycinin (Bray et al., 1987), or the 
L. fendleri kappa hydroxylase promoter described herein are 
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desired. Transcription initiation regions which are 
preferentially expressed in seed tissue, i.e., which are 
undetectable in other plant parts, are considered desirable 
for fatty acid modifications in order to minimize any 
5 disruptive or adverse effects of the gene product. 

Regulatory transcript termination regions may be provid d 
in DKA constructs of this invention as well. Transcript 
termination regions may be provided by the DNA sequence 
encoding the plant kappa hydroxylase or a convenient 

10 transcription termination region derived from a different gene 

source, for example, the transcript termination region which 
is naturally associated with the transcript initiation region. 
Where the transcript termination region is from a different 
gene source, it will contain at least about 0.5 kb, preferably 

15 about 1-3 kb of sequence 3* to the structural gene from which 

the termination region is derived. 

Plant expression or transcription constructs having a 
plant kappa hydroxylase as the DNA sequence of interest for 
increased or decreased expression thereof may be employed with 

20 a wide variety of plant life, particularly, plant life 

involved in the production of vegetable oils for edible and 
industrial uses. Most especially preferred are temperate 
oilseed crops. Plants of interest include, but are not 
limited to rapeseed (Canola and high erucic acid varieties) f 

25 Crambe, Brassica juncea, Brassica nigra, meadowfoaro, flax, 

sunflower, saf flower, cotton, Cuphea, soybean, peanut, coconut 
and oil palms and corn. An important criterion in the 
selection of suitable plants for the introduction on the kappa 
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hydroxylas is the presence in the host plant of a suitable 
substrate for the hydroxylase. Thus, for example, product! n 
of ricinoleic acid will be best accomplished in plants that 
normally have high levels of oleic acid in seed lipids. 
5 Similarly, production of lesquerolic acid will best be 

accomplished in plants that have high levels of icosenoic acid 
in seed lipids. 

Depending on the method for introducing the recombinant 
constructs into the host cell, other DNA sequences may be 

10 required. Importantly/ this invention is applicable to 

dicotyledons and monocotyledons species alike and will be 
readily applicable to new and/or improved transformation and 
regulation techniques. The method of transformation is n t 
critical to the current invention; various methods of plant 

15 transformation are currently available. As newer methods are 

available to transform crops, they may be directly appli d 
hereunder. For example, many plant species naturally 
susceptible to Agrobacteriura infection may be successfully 
transformed via tripartite or binary vector methods of 

2 0 Agrobacterium mediated transformation. In addition, 

techniques of microinjection, DNA particle bombardment, 
electroporation have been developed which allow for the 
transformation of various monocot and dicot plant species. 

In developing the DNA construct, the various components 

25 of the construct or fragments thereof will normally be 

inserted into a convenient cloning vector which is capabl of 
replication in a bacterial host, e.g., E. colx. Numerous 
vectors exist that have been described in the literature. 
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After each cloning, the plasmid may be isolated and subjected 
to further manipulation, such as restriction, insertion of new 
fragments, ligation, deletion, insertion, resection, etc., so 
as to tailor the components of the desired sequence. Once the 
5 construct has been completed, it may then be transferred to an 

appropriate vector for further manipulation in accordance with 
the manner of transformation of the host cell. 

Normally, included with the DNA construct will be a 
structural gene having the necessary regulatory regions for 

10 expression in a host and providing for selection of 

transformant cells. The gene may provide for resistance to a 
cytotoxic agent, e.g., antibiotic, heavy metal, toxin, etc., 
complementation providing prototropy to an auxotrophic host, 
viral immunity or the like. Depending upon the number of 

15 different host species the expression construct or components 

thereof are introduced, one or more markers may be employed, 
where different conditions for selection are used for the 
different hosts. 

It is noted that the degeneracy of the DNA code provides 

2 0 that some codon substitutions are permissible of DNA sequenc s 

without any corresponding modification of the amino acid 
sequence. 

As mentioned above, the manner in which the DNA construct 
is introduced into the plant host is not critical to this 
25 invention. Any method which provides for efficient 

transformation may be employed. Various methods for plant 
cell transformation include the use of Ti- or Ri-plasmids, 
microinjection, electroporation, infiltration, imbibition, DNA 
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particle bombardment, liposome fusion, DNA bombardment or the 
like. In many instances, it will be desirable to have the 
construct bordered on one or both sides of the T-DNA, 
particularly having the left and right borders, more 
particularly the right border. This is particularly useful 
when the construct uses A. tumefaciens or A. rhizogenes as a 
mode for transformation, although the T-DNA borders may find 
use with other modes of transformation. 

Where Agrobacterium is used for plant cell 
transformation, a vector may be used which may be introduced 
into the Agrobacterium host for homologous recombination with 
T-DNA or the Ti- or Ri-plasmid present in the Agrobacterium 
host. The Ti- or Ri-plasmid containing the T-DNA for 
recombination may be armed (capable of causing gall formation) 
or disarmed (incapable of causing gall) , the latter being 
permissible, so long as the vir genes are present in the 
transformed Agrobacterium host. The armed plasmid can give a 
mixture of normal plant cells and gall. 

In some instances where Agrojbacteriu/n is used as the 
vehicle for transforming plant cells, the expression construct 
bordered by the T-DNA border (s) will be inserted into a broad 
host spectrum vector, there being broad host spectrum vectors 
described in the literature. Commonly used is pRK2 or 
derivatives thereof. See, for example, Ditta et al., (1980), 

Included with the 
expression construct and the T-DNA will be one or more 
markers, which allow for selection of transformed 
Agrobacterium and transformed plant cells. A number of 
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markers have been developed for use with plant: cells, such as 
resistance to kanamycin, the aminoglycoside G418, hygromycin, 
or the like. The particular marker employed is not essential 
to this invention, one or another marker being preferred 
5 depending on the particular host and the manner of 

construct ion . 

For transformation of plant cells using Ag\roJbactex*ium, 
explants may be combined and incubated with the transformed 
Agrobacterium for sufficient time for transformation, the 

10 bacteria killed, and the plant cells cultured in an 

appropriate selective medium. Once callus forms, shoot 
formation can be encouraged by employing the appropriate plant 
hormones in accordance with known methods and the shoots 
transferred to rooting medium for regeneration of plants. The 

15 plants may then be grown to seed and the seed used to 

establish repetitive generations and for isolation of 
vegetable oils. 

The invention now being generally described, it will be 
more readily understood by reference to the following exampl s 

20 which are included for purposes of illustration only and are 

not intended to limit the present invention. 

EXAMPLES 

In the experimental disclosure which follows, all 
25 temperatures are given in degrees centigrade (°), weights are 

given in grams (g) , milligram (mg) or micrograms (/*g) , 
concentrations are given as molar (M) , millimolar (mM) or 
micromolar {/M) and all volumes are given in liters (1) , 
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microliters (^1) or milliliters (ml), unless otherwise 
indicated. 



5 EXAMPLE 1 - PRODUCTION OF NOVEL HYDROXYLATED FA TTY ACIDS IN 

ARABIDOPSIS THALIANA 
Overview 

The kappa hydroxylase encoded by the previously described 
fahl2 gene from Castor (U.S. Patent application 08/320,982) 
10 was used to produce ricinoleic acid, lesquerolic acid, 

densipolic acid and auricolic acid in transgenic Arabidopsis 
plants. This example reduces to practice the method taught in 
Example 2 of the foregoing application. 

Production of transgenic plants 

A variety of methods have been developed to insert a DNA 
sequence of interest into the genome of a plant host to obtain 
the transcription and translation of the sequence to eff ct 
phenotypic changes. The following methods represent only one 
of many equivalent means of producing transgenic plants and 
causing expression of the hydroxylase gene. 

Arabidopsis plants were transformed, by Agrobacterium- 
mediated transformation, with the kappa hydroxylase encoded by 
the Castor fan 12 gene on binary Ti plasmid pB6. This plasmid 
was previously used to transform Nicotiana tabacum for the 
production of ricinoleic acid (U.S. Patent application 
08/320,982). 

Inoculums of AgrroJbacterium tujnefaciens strain GV3101 
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containing binary Ti pi asm id pB6 wer plated on L-br th plates 
containing 50 Mg/ml Kanamycin and incubated for 2 days at 
30°C. Single colonies were used to inoculate large liquid 
cultures (L-broth medium with 50 mg/1 rifampicin, 110 mg/1 
5 gentamycin and 200 mg/1 kanamycin) to be used for the 

transformation of Arabidopsis plants. 

Arabidopsis plants were transformed by the in plant* 
transformation procedure essentially as described by Bechtold 
et al., (1993). Cells of A. tumefaclens GV3101(pB6) were 

10 harvested from liquid cultures by centrif ugation, then 

resuspended in infiltration medium at OD^ - 0.8 (Infiltration 
medium was Murashige and Skoog macro and micronutrient medium 
(Sigma Chemical Co., St. Louis, MO) containing 10 mg/1 
6-benzylaminopurine and 5% glucose) . Batches of 12-15 plants 

15 were grown for 3 to 4 weeks in natural light at a mean daily 

temperature of approximately 25 °C in 3.5 inch pots containing 
soil. The intact plants were immersed in the bacterial 
suspension then transferred to a vacuum chamber and placed 
under 600 mm of vacuum produced by a laboratory vacuum pump 

2 0 until tissues appeared uniformly water-soaked (approximately 

10 min) . The plants were grown at 25 °C under continuous light' 
(100 jimol m* 2 s* 1 irradiation in the 4 00 to 700 nm range) for 
four weeks. The seeds obtained from all the plants in a p t 
were harvested as one batch. The seeds were sterilized by 

25 sequential treatment for 2 min with ethanol followed by 10 min 

in a mixture of household bleach (Chlorox) , water and Tween-8 0 
(50%, 50%, 0.05%) then rinsed thoroughly with sterile water. 
The seeds were plated at high density (2000 to 4000 per plate) 
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onto agar-solidif ied medium in 100 mm petri plates containing 
1/2 X Murashige and Skoog salts medium enriched with B5 
vitamins (Sigma Chemical Co., St. Louis, MO) and containing 
kanamycin at 50 mg/1. After incubation for 48 h at 4°C to 
stimulate germination, seedlings were grown for a period of 
seven days until transf ormants were clearly identifiable as 
healthy green seedlings against a background of chlorotic 
kanamycin-sensitive seedlings. The transf ormants were 
transferred to soil for two weeks before leaf tissue could be 
used for DNA and lipid analysis. More than 20 transf ormants 
were obtained. 

DNA was extracted from young leaves from transf ormants to 
verify the presence of an intact fahl2 gene. The presence of 
the transgene in a number of the putative transgenic lines was 
verified by using the polymerase chain reaction to amplify the 
insert from pB6. The primers used were HF2 = 
GCTCTTTTGTGCGCTCATTC (SEQ ID NO: 12) and HR1'= 

CGGTACCAGAAAACGCCTTG (SEQ ID NO: 13), which were designed to. 
allow the amplification of a 700 bp fragment. Approximat ly 
100 ng of genomic DNA was added to a solution containing 2 5 
pmol of each primer, 1.5 U Taq polymerase (Boehringer 
Manheim) , 200 uM of dNTPs, 50 mM KC1, 10 mM Tris.Cl (pH 9), 
0.1% (v/v) Triton X-100, 1.5 mM MgCl 2 , 3% (v/v) formamide, to a 
final volume of 50 pi. Amplifications conditions were: 4 min 
denaturation step at 94 °C, followed by 30 cycles of 92 °C for 1 
min, 55°C for 1 min, 72°C for 2 min. A final extension step 
closed the program at 72 °C for 5 min. Transf ormants could be 
positively identified after visualization of a characteristic 
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1 Xb amplified fragment on an ethidium bromide stained agarose 
gel. All transg nic lines tested gave a PCR product of a siz 
consistent with the expected genotype, confirming that the 
lines were, indeed, transgenic. All further experiments were 
5 done with three representative transgenic lines of the wild 

type designated as 1-3, 4D, 7-4 and one transgenic line of the 
fad2 mutant line JB12 . The transgenic JB12 line was included 
in order to test whether the increased accumulation of oleic 
acid in this mutant would have an effect on the amount of 
10 ricinoleic acid that accumulated in the transgenic plants. 



Analysis of transgenic plants 

Leaves and seeds from fahl2 transgenic Arabidopsis plants 
were analyzed for the presence of hydroxylated fatty acids 

15 using gas chromatography. Lipids were extracted from 100-200 

mg leaf tissue or 50 seeds. Fatty acid methyl esters (FAMES) 
were prepared by placing tissue in 1.5 ml of 1.0 M methanolic 
HC1 (Supelco Co.) in a 13 x 100 mm glass screw-cap tube capped 
with a teflon-lined cap and heated to 80 °C for 2 hours. Upon 

20 cooling, 1 ml petroleum ether was added and the FAMES removed 

by aspirating off the ether phase which was then dried under a 
nitrogen stream in a glass tube. One hundred /il of 
N,0-bis(Trimethylsilyl) trif luoroacetamide (BSTFA; Pierce 
Chemical Co) and 200 fil acetonitrile was added to derivatize 

25 the hydroxyl groups. The reaction was carried out at 70 °C for 

15 min. The products were dried under nitrogen, redissolved 
in 100 /xl chloroform and transferred to a gas chromatograph 
vial. Two Ml of each sample were analyzed on a SP2340 fused 
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silica capillary column (30 m, 0.75 mm ID, 0.20 mm film, 
Supelco) , using a Hewlett-Packard 5890 II series Gas 
Chromatograph. The samples were not split, the temperature 
program was 195°C for 18 min, increased to 230°C at 25°C/min, 
5 held at 230°C for 5 min then down to 195°C at 25°C/min., and 

flame ionization detectors were used. 

The chromatographic elution time of methyl esters and O- 
TMS derivatives of ricinoleic acid, lesquerolic acid and 
auricolic acid was established by GC-MS of lipid samples fr m 

10 seeds of L . fendleri and comparison to published chromatograms 

of fatty acids from this species (Carlson et al., 1990). A O- 
TMS-methyl-ricinoleate standard was prepared from ricinoleic 
acid obtained from Sigma Chemical Co (St, Louis, MO). 0-TMS- 
methyl-lesqueroleate and O-TMS-methyl-auricoleate standards 

15 were prepared from triacylglycerols purified from seeds of L. 

fendleri. The mass spectrum of O-TMS-methyl-ricinoleate, O- 
TMS-methyl-densipoleate, O-TMS-methyl-lesqueroleate, and 0- 
TMS-methyl-auricoleate are shown in Figures 1A-D, 
respectively. The structures of the characteristic ions 

20 produced during mass spectrometry of these derivatives are 

shown in Figure 2. 

Lipid extracted from transgenic tissues were analyzed by 
gas chromatography and mass spectrometry for the presence of 
hydroxy lated fatty acids. As a matter of reference, the 

25 average fatty acid composition of leaves in Arabidopsis wild 

type and fad2 mutant lines was reported by Miquel and Browse 
(1992). Gas chromatograms of methylated and silylated fatty 
acids from seeds of wild type and a fahl2 transgenic wild type 
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plant are shown in Figures 3A and 3B, r spectively. The 
profiles are very similar except for the presence of three 
small but distinct peaks at 14.3, 15.9 and 18.9 minutes. A 
very small peak at 20.15 min was also evident. The elution 
time of the peaks at 14.3 and 18.9 min corresponded precisely 
to that of comparably prepared ricinoleic and lesquerolic 
standards, respectively. No significant differences were 
observed in lipid extracts from leaves or roots of the wild 
type and the fahl2 transgenic wild type lines (Table 1) . 
Thus, in spite of the fact that the fahl2 gene is expressed 
throughout the plant, we observed effects on fatty acid 
composition only in seed tissue. A similar observation was 
described previously for transgenic fahl2 tobacco in patent 
application No. 08/320,982. 
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Table 1 : Fatty acid composition of lipids from transgenic and wild type Arabidopsis. The values are 
the means obtained from analysis of samples from three independent transgenic lines, or three 
independent samples of wild type and fad2 lines. 
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In order to confirm that the observed new peaks in the 
transgenic lines corresponded to derivatives of ricinoleic, 
lesquerolic, densipolic and auricolic acids, mass spectrometry 
was used. The fatty acid derivatives were resolved by gas 
5 chromatography as described above except that a 

Hewlett-Packard 5971 series mass selective detector was used 
in place of the flame ionization detector used in the previous 
experiment. The spectra of the four new peaks in Figure 3B 
(peak numbers 10, 11, 12 and 13) are shown in Figures 4A-D, 

10 respectively. Comparison of the spectrum obtained for the 

standards with that obtained for the four peaks from the 
transgenic lines confirms the identity of the four new peaks. 
On the basis of the three characteristic peaks at M/Z 187, 270 
and 299, peak 10 is unambiguously identified as O-TMS- 

15 methylricinoleate. On the basis of the three characteristic 

peaks at M/Z 185, 270 and 299, peak 11 is unambiguously 
identified as O-TMS-methyldensipoleate. On the basis of the 
three characteristic peaks at M/Z 187, 298 and 327, peak 12 is 
unambiguously identified as O-TMS-methyllesqueroleate. On the 

2 0 basis of the three characteristic peaks at M/Z 18 5, 298 and 

327, peak 13 is unambiguously identified as O-TMS- 
methylauricoleate. These results unequivocally demonstrate 
the identity of the fahl2 cDNA as encoding a hydroxylase that 
hydroxylat.es both oleic acid to produce ricinoleic acid and 

25 also hydroxylate icosenoic acid to produce lesquerolic acid. 

These results also provide additional evidence that the 
hydroxylase can be functionally expressed in a heterologous 
plant species in such a way that the enzym is catalytically 
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functional. These results also demonstrate that expression of 
this hydroxylase gene leads to accumulation of ricinoleic, 
lesquerolic, densipolic and auricolic acids in a plant species 
that does not normally accumulate hydroxy lated fatty acids in 
5 extractable lipids. 

The presence of lesquerolic acid in the transgenic plants 
was anticipated in the previous patent application (No. 
08/320,982) based on the biochemical evidence suggesting broad 
substrate specificity of the kappa hydroxylase. By contrast, 

10 the accumulation of densipolic and auricolic acids was less 

predictable. Since Arabidopsis does not normally contain 
significant quantities of the non-hydroxy lated precursors of 
these fatty acids which could serve as substrates for the 
hydroxylase, it appears that one or more of the three n-3 

15 fatty acid desaturases known in Arabidopsis (eg., f ad3 , f ad7 , 

fad8; reviewed in Gibson et al., 1995) are capable of 
desaturating the hydroxylated compounds at the n-3 position. 
That is, densipolic acid is produced by the action of an nr3 
desaturase on ricinoleic acid. Auricolic acid is produced by 

20 the action of an n-3 desaturase on lesquerolic acid. Because 

it is located in the endoplasmic reticulum, the fad3 
desaturase is almost certainly responsible. This can be 
tested in the future by producing f ah!2-containing transgenic 
plants of the f ad3-def icient mutant of Arabidopsis (similar 

25 experiments can be done with fad7 and fad8) . It is also 

formally possible that the enzymes that normally elongate 
18:l cia * to 20:l c1fi * 11 may elongate 120H-18 : l cisA9 to 14OH-20: l cisA11 , 
and 120H-18:2 cis *' 15 to 14OH-20 : 2 ci5d11 ' 17 . 
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The amount of th various fatty acids in seed, 1 af and 
root lipids of the control and transgenic plants is presented 
in Table 1. Although the amount of hydroxy lated fatty acids 
produced in this example is less than desired for commercial 
5 production of ricinoleate and other hydroxylated fatty acids 

from plants, we envision numerous improvements of this 
invention that will increase the level of accumulation of 
hydroxylated fatty acids in plants that express the fahl2 or 
related hydroxylase genes. Improvements in the level and 
10 tissue specificity of expression of the hydroxylase gene is 

envisioned. Methods to accomplish this by the use of strong, 
seed-specific promoters such as the B . napus napin promoter or 
the native promoters of the castor fahl2 gene or the 
corresponding hydroxylase gene from X. fendleri will be 
15 obvious to one skilled in the art. Additional improvements 

are envisioned to involve modification of the enzymes which 
cleave hydroxylated fatty acids from phosphatidylcholine, 
reduction in the activities of enzymes which degrade 
hydroxylated fatty acids and replacement of acyltransf erases 
2 0 which transfer hydroxylated fatty acids to the sn-1, sn-2 and 

sn-3 positions of glycerolipids . Although genes for these 
enzymes have not been described in the scientific literature, 
their utility in improving the level of production of 
hydroxylated fatty acids can be readily envisioned based on 
2 5 the results of biochemical investigations of ricinoleate 

synthesis. 

Although Arabidopsis is not an economically important 
plant species, it is widely accepted by plant biologists as a 
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model for higher plants. Therefore, the inclusion of this 
example is intended to demonstrate the general utility of the 
invention described here 

to the modification of oil composition in higher 
5 plants. One advantage of studying the expression of this 

novel gene in Arabidopsis is the existence in this system of a 
large body of knowledge on lipid metabolism, as well as the 
availability of a collection of mutants which can be used to 
provide useful information on the biochemistry of fatty acid 

10 hydroxy lation in plant species. Another advantage is the ease 

of transposing any of the information obtained on metabolism 
of ricinoleate in Arabidopsis to closely related species such 
as the crop plants Brasslca napus, Brassica juncea or Crambe 
Abyssinica in order to mass produce ricinoleate, lesqueroleate 

15 or other hydroxylated fatty acids for industrial use. The 

kappa hydroxylase is useful for the production of ricinoleate 
or lesqueroleate in any plant species that accumulates 
significant levels of the precursors, oleic acid and icosenoic 
acid. Of particular interest are genetically modified 

20 varieties that accumulate high levels of oleic acid. Such 

varieties are currently available for sunflower and Canola. 
Production of lesquerolic acid and related hydroxy fatty acids 
can be achieved in species that accumulate high levels of 
icosenoic acid or other long chain monoenoic acids. Such 

2 5 plants may in the future be produced by genetic engineering of 

plants that do not normally make such precursors. Thus, we 
envision that the use of the kappa hydroxylase is of general 
utility. 
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EXAMPLE 2. ISOLATION OF LESOUERELLA KAPPA HYDROXYLASE GENOMIC 

CLONE 

Overview 

5 Regions of nucleotide sequence that were conserved in both the 

Castor kappa hydroxylase and the Arabidopsis fad2 A12 fatty 
acid desaturase were used to design oligonucleotide primers. 
These were used with genomic DNA from Lesquerella fendleri to 
amplify fragments of several homologous genes. These 

10 amplified fragments were then used as hybridization probes to 

identify full length genomic clones from a genomic library of 
i. fendleri . Hydroxylated fatty acids are specific to the 
seed tissue of Lesquerella sp. , and are not found to any 
appreciable extent in vegetative tissues. One of the two 

15 genes identified by this method was expressed in both leaves 

and developing seeds and is therefore thought to correspond to 
the A12 fatty acid desaturase. The other gene was expressed 
at high levels in developing seeds but was not expressed or 
was expressed at very low levels in leaves and is the kappa 

20 hydroxylase from this species. The identity of this gene will 

be established by introducing the gene into transgenic 
Arabidopsis plants and showing that it causes the accumulation 
of ricinoleic acid, lesquerolic acid, densipolic acid and 
auricolic acid in seed lipids. The promoter of this gene is 

25 also of utility because it is able to direct expression of a 

gene specifically in developing seeds at a time when storage 
lipids are accumulating. This promoter is, therefore, of 
great utility for many applications in the genetic engineering 
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of seeds, particularly in members of the Brassicacea. 

The various steps involved in this process are described 
in detail below. Unless otherwise indicated, routine methods 
for manipulating nucleic acids, bacteria and phage were as 
5 described by Sambrook et al. (1989). 

Isolation of a fragment of the Lesouerella kappa hydroxylase 
gene 

Oligonucleotide primers for the amplification of the L, 
10 fendleri kappa hydroxylase were designed by choosing regions 
of high deduced amino acid sequence homology between the 
Castor kappa hydroxylase and the Arabidopsis Al2 desaturase 
(fad2). Because most amino acids are encoded several 
different codons, these oligonucleotides were designed to 
15 encode all possible codons that could encode the corresponding 

amino acids. The sequence of these mixed oligonucleotides was: 
Oligo 1: TAYWSNCAYMGNMGNCAYCA ( SEQ ID NO: 14) 
Oligo 2: RTGRTGNG CN ACRTGNGTRT C (SEQ ID NO: 15) 
(Where: Y = C+T; W = A+T; S = G+C; N = A+G+C+T; M = A+C; R « 
20 A+G) 

These oligonucleotides were used to amplify a fragment of 
DNA from i. fendleri genomic DNA by the polymerase chain 
reaction (PCR) using the following conditions: Approximately 
100 ng of genomic DNA was added to a solution containing 25 
25 pmol of each primer, 1.5 U Taq polymerase (Boehringer 

Manheim) , 200 uM of dNTPs, 50 mM KC1, 10 mM Tris.Cl (pH 9), 
0.1% (v/v) Triton X-100, 1.5 mM MgCl 2 , 3% (v/v) formamide f to a 
final volume of 50 jil. Amplifications conditions were: 4 min 
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denaturation step at 94°C, followed by 30 cycles of 92°C for 1 
min, 55°C for l min, 72°C for 2 min. A final extension step 
closed the program at 72 °C for 5 min. 

PCR products of approximately 540 bp were observed 
5 following electrophoretic separation of the products of the 

PCR reaction in agarose gels. Two of these fragments were 
cloned into pBluescript (Stratagene) to give rise to plasmids 
pLesq2 and pLesq3. The sequence of the inserts in these two 
plasmids was determined by the chain termination method. The 

10 sequence of the insert in pLesq2 is presented as Figure 5 (SEQ 

ID NO:l) and the sequence of the insert in pLesq3 is presented 
as Figure 6 (SEQ ID NO: 2). The high degree of sequence 
identity between the two clones indicated that they were both 
potential candidates to be either a A12 desaturase or a gama 

15 hydroxylase. 

Northern analysis 

In L. fendlerl, hydroxy lated fatty acids are found in 
large amounts in seed oils but are not found in appreciable 

2 0 amounts in leaves. Therefore, an important criterion in 

discriminating between a fatty acyl desaturase and kappa 
hydroxylase is that the kappa hydroxylase gene is expected to 
be expressed more highly in tissues which have high level of 
hydroxylated fatty acids than in other tissues whereas all 

25 plant tissues should contain mRNA for an 06 fatty acyl 

desaturase since diunsaturated fatty acids are found in the 
lipids of all tissues in most or all plants. Therefore, it 
was of great interest to determine whether the gene 
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corr sponding to pLesq2 was also expr ssed only in seeds, or 
is also expressed in other tissues. This qu stion was 
addressed by testing for hybridization of pLesq2 to RNA 
purified from developing seeds and from leaves. 
5 Total RNA was purified from developing seeds and young 

leaves of X. fendleri using an Rneasy RNA extraction kit 
(Qiagen) , according to the manufacturer's instructions. RNA 
concentrations were quantified by UV spectrophotometry at 
A.=260 and 280 nm. In order to ensure even loading of the gel 

10 to be used for Northern blotting, RNA concentrations were 

further adjusted after recording fluorescence under UV light 
of RNA samples stained with ethidium bromide and run on a test 
denaturing gel. 

Total RNA prepared as described above from leaves and 

15 developing seeds was electrophoresed through an agarose gel 

containing formaldehyde (Iba et al . , 1993). An equal quantity 
(10 pq) of RNA was loaded in both lanes, and RNA standards 
(0.16-1.77 kb ladder, Gibco-BRL) were loaded in a third lane,. 
Following electrophoresis, RNA was transferred from the gel to 

2 0 a nylon membrane (Hybond N+, Amersham) and fixed to the filter 

by exposure to UV light. A 32 P-labelled probe was prepared 
from insert DNA of clone pLesq2 by random priming and 
hybridized to the membrane overnight at 52°C, after it had be n 
prehybridized for 2 h. The prehybridization solution 

25 contained 5X SSC, 10X Denhardt*s solution, 0.1% SDS, 0.1M KPO A 

pH 6.8, 100 /^g/ml salmon sperm DNA. The hybridization 
solution had the same basic composition, but no SDS, and it 
contained 10% dextran sulfate and 30% formamide. The blot was 
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washed once in 2X SSC, 0.5% SDS at 65°C then in IX SSC at the 
same temperature. 

Brief (30 min) exposure of the blot to X-ray film 
revealed that the probe pLesq2 hybridized to a single band 
5 only in the seed RNA lane (Figure 7) . The blot was re-probed 

with the insert from pLesq3 gene, which gave bands of similar 
intensity in the seed and leaf lanes (Figure 7) . 

These results show that the gene corresponding to the 
clone pLesq2 is highly and specifically expressed in seed of 
10 X. fendleri . In conjunction with knowledge of the nucleotide 

and deduced amino acid sequence, strong seed-specific 
expression of the gene corresponding to the insert in pLesq2 
is a convincing indicator of the role of the enzyme in 
synthesis of hydroxylated fatty acids in the seed oil. 

15 

Characterization of a genomic clone of the gamma hydroxylase 

Genomic DNA was prepared from young leaves of L. fendl ri 
as described by Murray and Thompson (1980). A SauJAI-partial 
digest genomic library constructed in the vector ADashll 

20 (Stratagene, 11011 North Torrey Pines Road, La Jolla CA 92037) 

was prepared by partially digesting 500 ^g of DNA, size- 
selecting the DNA on a sucrose gradient (Sambrook et al., 
1989), and ligating the DNA (12 k.b average size) to the BamHT- 
digested arms of ADashll. The entire ligation was packaged 

25 according to the manufacturer's conditions and plated on E. 

coli strain XLl-Blue MRA-P2 (Stratagene) . This yielded 5xl0 5 
primary recombinant clones. The library was then amplified 
according to the manufacturer's conditions. A fraction of the 
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g n mic library was plated on E. coli XLl-Blue and resulting 
plagues (150,000) were lifted to charged nylon membranes 
(Hybond N+, Amersham) , according to the manufacturer's 
conditions. DNA was crosslinked to the filters under UV in a 
5 Stratalinker (Stratagene) . 

Several clones carrying genomic sequences corresponding 
to the L . fendleri hydroxylase were isolated by probing the 
membranes with the insert from pLesq2 that was PCR-amplif ied 
with internal primers and labelled with 32 P by random priming. 

10 The filters were prehybridized for 2 hours at 65°C in 7% SDS, 

ImM EDTA , 0.25 M Na 2 HP0 4 (pH 7.2), 1% BSA and hybridized to 
the probe for 16 hours in the same solution. The filters were 
sequentially washed at 65°C in solutions containing 2 X SSC, 1 
X SSC , 0.5 X SSC in addition to 0.1 % SDS. A 2.6 kb Xba I 

15 fragment containing the complete coding sequence for the 

gamma-hydroxylase and approximately 1 kb of the 5' upstream 
region was subcloned into the corresponding site of 
pBluescript KS to produce plasmid pLesq-Hyd and the sequ ncs 
determined completely using an automatic sequencer by the 

20 dideoxy chain termination method. Sequence data was analyzed 

using the program DNASIS (Hitachi company) . 

The sequence of the insert in clone pLesq-Hyd is shown in 
Figures 8A-B. The sequence entails 1855 bp of contiguous DNA 
sequence (SEQ ID NO:3). The clone encodes a 401 bp 5 1 

25 untranslated region (i.e., nucleotides preceding the first ATG 

codon) , an 1152 bp open reading frame, and a 302 bp 3* 
untranslated region. The open reading frame encodes a 3 84 
amino acid protein with a predicted molecular weight of 44,370 
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(SEQ ID NO:4). The amino terminus lacks features of a typical 
signal peptide (von Heijne, 1985). 

The exact translation-initiation methionine has not been 
experimentally determined, but on the basis of deduced amino 
acid sequence homology to the Castor kappa hydroxylase (n ted 
below) is thought to be the methionine encoded by the first 
ATG codon at nucleotide 4 02. 

Comparison of the pLesq-Hyd deduced amino acid sequence 
with sequences of membrane-bound desaturases and the castor 
hydroxylase (Figures 9A-B) indicates that pLesq-Hyd is 
homologous to these genes. This figure shows an alignment of 
the L . fendlerl hydroxylase (SEQ ID NO: 4) with the castor 
hydroxylase (van de Loo et al. 1995), the Arabidopsis fad2 
cDNA which encodes an endoplasmic reticulum-localized A12 
desaturase (called fad2) (Okuley et al. , 1994), two soybean 
fad2 desaturase clones, a Brassica napus fad2 clone, a Z a 
mays fad2 clone and partial sequence of a J?, communis fad2 
clone. The high degree of sequence homology indicates that 
the gene products are of similar function. For instance, the 
overall homology between the Lesquerella hydroxylase and the 
Arabidopsis fad2 desaturase was 92.2% similarity and 84.8% 
identity and the two sequences differed in length by only one 
amino acid. 

Southern hybrid ization 

Southern analysis was used to examine the copy number of 
the genes in the L. fendleri genome corresponding to the clone 
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pLesq-Hyd. G nomic DNA (5 ^g) was digested with EcoR I, Hind 
III and Xba I and separated on a 0.9% agarose gel. DNA was 
alkali-blotted to a charged nylon membrane (Hybond N+, 
Amersham) , according to the manufacturer's protocol. The blot 
was prehybridized for 2 hours at 65°C in 7% SDS, lmM EDTA, 0.25 
M NajHPO^ (pH 7.2), 1% BSA and hybridized to the probe for 16 
h urs in the same solution with pLesq-Hyd insert PCR-amplif ied 
with internal primers and labelled with 32 P by random priming. 
The filters were sequentially washed at 65°C in solutions 
containing 2 X SSC, 1 X SSC, 0.5 X SSC in addition to 0.1 % 
SDS, then exposed to X-ray film. 

The probe hybridized with a single band in each digest of 
L . fendleri DNA (Figure 10) , indicating that the gene from 
which pLesq-Hyd was transcribed is present in a single copy in 
the L . fendleri genome. 

Expression of p Lesa-Hvd in Transgenic Plants 

There are a wide variety of plant promoter sequences 
which may be used to cause tissue-specific expression of 
cloned genes in transgenic plants. For instance, the napin 
promoter and the acyl carrier protein promoters have 
previously been used in the modification of seed oil 
composition by expression of an antisense form of a desaturase 
(Knutson et al. 1992). Similarly, the promoter for the 0- 
subunit of soybean /3-conglycinin has been shown to be highly 
active and to result in tissue-specific expression in 
transgenic plants of species other than soybean (Bray et al. , 
1987) . Thus, although we describe the use of the X. fendleri 
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kappa hydroxylase promoter in the examples described here, 
other promoters which lead to seed-specific expression may 
also be employed for the production of modified seed oil 
composition. Such modifications of the invention described 
here will be obvious to one skilled in the art. 

Constructs for expression of L. fendleri kappa 
hydroxylase in plant cells are prepared as follows: A 13 kb 
Sail fragment containing the pLesq-Hyg gene was ligated into 
the Xhol site of binary Ti plasmid vector pSLJ4402 6 (Jones t 
al., 1992) (Figure 11) to produce plasmid pTi-Hyd and 
transformed into Agrobacterlum tumefaciens strains GV3101 by 
electroporation. Strain GV3101 (Konc2 and Schell, 1986) 
contains a disarmed Ti plasmid. Cells for electroporation 
were prepared as follows. GV3101 was grown in LB medium with 
reduced NaCl (5 g/1) . A 250 ml culture was grown to OD^q = 
0.6, then centrifuged at 4000 rpm (Sorvall GS-A rotor) for 15 
min. The supernatant was aspirated immediately from the loose 
pellet, which was gently resuspended in 500 ml ice-cold water. 
The cells were centrifuged as before, resuspended in 30 ml 
ice-cold water, transferred to a 30 ml tube and centrifuged at 
5000 rpm (Sorvall SS-34 rotor) for 5 min. This was repeated 
three times, resuspending the cells consecutively in 30 ml 
ice-cold water, 3 0 ml ice-cold 10% glycerol, and finally in 
0.7 5 ml ice-cold 10% glycerol. These cells were aliguoted, 
frozen in liquid nitrogen, and stored at -80C. 
Electroporations employed a Biorad Gene pulsar instrument 
using cold 2 mm-gap cuvettes containing 4 0 m± cells and 1 
of DNA in water, at a voltage of 2.5 KV, and 200 Ohms 
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resistance. The electroporated cells were diluted with 1 ml 
SOC medium (Sambrook et al., 1989, page A2) and incubated at 
28 °C for 2-4 h before plating on medium containing kanaroycin 
(50 mg/l> . 

5 Arabidopsis thaliana can be transformed with the 

Agrobacterium cells containing pTi-Hyd as described in Example 
1 above. Similarly, the presence of hydroxylated fatty acids 
in the transgeneic Arabidopsis plants can be demonstrated by 
the methods described in Example 1 above. 

0 

Constitutive expression of the L. fendleri hydroxylase in 
transgenic plants 

A 1.5 kb EcoR I fragment from pLesq-Hyg comprising the 
entire coding region of the hydroxylase was gel purified, then 

5 cloned into the corresponding site of pBluescript KS 

(Stratagene) . Plasmid DNA from a number of recombinant clones 
was then restricted with Pst I, which should cut only once in 
the insert and once in the vector poly linker sequence. 
Release of a 920 bp fragment with Pst I indicated the right 

} orientation of the insert for further manipulations. DNA from 

one such clone was further restricted with Sail, the 5' 
overhangs filled-in with the Klenow fragment of DNA polymerase 
I, then cut with Sac I. The insert fragment was gel purified, 
and cloned between the Sma I and Sac I sites of pBH2l 

5 (Clontech) behind the Cauliflower Mosaic Virus 35S promoter. 

After checking that the sequence of the junction between 
insert and vector DNA was appropriate, plasmid DNA from a 
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recombinant clone was used to transform A. tumefaclens 
(GV3101) . Kanamycin resistant colonies were then used for in 
planta transformation of A. thaliana as previously described. 
DNA was extracted from kanamycin resistant seedlings and 
5 used to PCR-aroplify selected fragments from the hydroxylase 
using nested primers. When fragments of the expected siz 
could be amplified, corresponding plants were grown in the 
greenhouse or on agar plates, and fatty acids extracted from 
fully expanded leaves, roots and dry seeds. GC-MS analysis 
10 was then performed as previously described to characterize the 
different fatty acid species and detect accumulation of 
hydroxy fatty acids in transgenic tissues. 



EXAMPLE 3 - OBTAINING OTHER PLANT FATTY ACYL HYDROXYLASES 
15 In a previous patent application, we described the ways 

in which the castor fahl2 sequence could be used to identify 
other kappa hydroxylases by methods such as PCR and 
heterologous hybridization. However, because of the high 
degree of sequence similarity .between A12 desaturases and 
20 kappa hydroxylases, prior art does not teach how to 

distinguish between the two kinds of enzymes without a 
functional test such as demonstrating activity in transgenic 
plants or another suitable host (e.g., transgenic microbial or 
animal cells). The identification of the L . fendleri 
25 hydroxylase provided for the development of criteria by which 
a hydroxylase and a desaturase may be distinguished solely on 
the basis of deduced amino acid sequence information. 

Figures 9A-B show a sequence alignment of the castor and 
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Is, fendleri hydroxylase sequences with the castor hydroxylase 
sequence and all publically available sequences for all plant 
microsomal Al2 fatty acid desaturases. Of the 384 amino acid 
residues in the castor hydroxylase sequence, more than 95% are 
identical to the corresponding residue in at least one of the 
desaturase sequences. Therefore, none of these residues are 
responsible for the catalytic differences between the 
hydroxylase and the desaturases. Of the remaining 16 residues 
in the castor hydroxylase and 14 residues in the Lesquerella 
hydroxylase, all but six represent instances where the 
hydroxylase sequence has a conservative substitution compared 
with one or more of the desaturase sequences, or there is wide 
variability in the amino acid at that position in the various 
desaturases. By conservative, we mean that the following 
amino acids are functionally equivalent : Ser/Thr, lie/ Leu/ 
Val/ Met, Asp/ Glu. Thus, these structural differences also 
cannot account for the catalytic differences between the 
desaturases and hydroxylases. This leaves just six amino acid 
residues where both the castor hydroxylase and the Lesquerella 
hydroxylase differ from all of the known desaturases and where 
all of the known microsomal 412 desaturases have the identical 
amino acid residue. These residues occur at positions 69, 
111, 155, 226, 304 and 331 of the alignment in Figure 9. 
Therefore, these six sites distinguish hydroxylases from 
desaturases. Based on this analysis, we claim that any enzyme 
with greater than 60% sequence identity to one of the enzymes 
listed in Figure 9 can be classified as a hydroxylase if it 
differs from the sequence of the desaturases at these six 
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positions. B caus of slight differences in th number of 
residues in a particular protein, the numbering may vary from 
protein to protein but the intent of the number system will be 
evident if the protein in question is aligned with the castor 
5 hydroxylase using the numbering system shown herein. Thus, in 
conjunction with the methods for using the lesquerella 
hydroxylase gene to isolate homologous genes, the structural 
criterion disclosed here teaches how to isolate and identify 
plant kappa hydroxylase genes for the purpose of genetically 
10 modifying fatty acid composition , 

In considering which of the six substitutions are solely 
or primarily responsible for the difference in catalytic 
activity of the hydroxylases of this invention and the 
desaturases, we consider it likely that the substitution of a 
Phe for a Tyr at position 22 6 may be solely responsible for 
this difference in catalytic activity because of the known 
participation of tyrosine radicals in enzyme catalysis. Other 
substitutions, such as the Ala for Ser at position 3 31 may 
have effects at modulating the overall rate of the reaction. 
On this basis we envision creating novel kappa hydroxylases by 
site directed mutagenesis of A 12 desaturases. We also 
envision converting A 15 desaturases and A 9 desaturases to 
hydroxylases by similar use of site-directed mutagenesis. 

CONCLUDING REMARKS 

By the above examples, demonstration of critical factors 
in the production of novel hydroxylated fatty acids by 
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expression of a kappa hydroxylase gene from Castor in 
transgenic plants is described. In addition, a complete cDNA 
sequence of the Lesguerella fendleri kappa hydroxylase is also 
provided. A full sequence of the castor hydroxylase is also 
given with various constructs for use in host cells. Through 
this invention, one can obtain the amino acid and nucleic acid 
sequences which encode plant fatty acyl hydroxylases from a 
variety of sources and for a variety of applications. 

Although the foregoing invention has been described in 
some detail by way of illustration and example for purposes of 
clarity of understanding, it will be obvious that certain 
changes and modifications may be practiced within the scope of 
the appended claims. 



- 60- 

BNSOOCID: tWO 96100 75A1_I_> 



WO 96/10075 



PCTAJS95/11855 



REFERENCES : 

Beltz, G.A. , Jacobs, K.A. , Eickbuch, T.H., Cherbas, P.T., 
5 Kafatos, F.C. (1983) Isolation of multigene families and 

determination of homologies by filter hybridization methods. 
Methods in Enzymology 100, 266-285. 

Bray, E.A. , Naito, S. , Pan, N.S., Anderson, E. , Dube, P., 
10 Beachy, R.N. (1987) Expression of the 0-subunit of 0- 

conglycinin in seeds of transgenic plants. Planta 172:364-370. 

Carlson, K.D., Chaudhry, A- , Bagby, M.O (1990) Analysis of 
oil and meal from lesguerella fendleri seed. J. Am. Oil Chem. 
15 Soc. 67, 438-442. 

Ditta, G., Sfanfield, S., Corbin, D. , Helinski, D.R. (1980) 
Broad host range DNA cloning system for gram-negative 
bacteria: Construction of a gene bank of Rhizoblum meliloti. 
20 Proc. Natl. Acad. Sci. USA 77,7347-7 3 51. 

Gould, S.J. , Subramani, S. , Scheffler, I.E. (1989) Use of the 
DNA polymerase chain reaction for homology probing. Proc. 
Natl. Acad. Sci. USA 86, 1934-1938. 

25 

Hirsinger, F. (198 9) New oil crops, in Oil Crops of the World, 
Robbelen, G . , Downey, K.R., and Ashri, A., Eds., McGraw-Hill, 
New York, pp. 518-533. 

- 61 - 



BNSOOCID: <WO 9610075A1 J_> 



WO 96/10075 



PCTAJS95/11855 



Howling, D., Morris, L.J., Gurr, M.I., James, A.T. (1972) The 
specificity of fatty acid desaturases and hydroxylases. The 
dehydrogenation and hydroxylation of monoenoic acids, Biochim. 
Biophys. Acta 260, 10. 

5 

Huyuh, T.V. , Young, R. A. , Davis, R.W, (1985) Constructing and 
screening cDNA libraries in AgtlO and igtll. In DNA Cloning, 
Vol. 1: A Practical Approach, (ed) D.M. Glover. IRL Press, 
Washington DC pp 4 9-77. 

0 

Iba, K. , Gibson, S., Nishiuchi, T., Fuse, T. , Nishimura, M. , 
Arondel, V., Hugly, S., and Somerville, C. (1993) A gene 
encoding a chloroplast omega-3 fatty acid desaturase 
complements alterations in fatty acid desaturation and 
5 chloroplast copy number of the fad7 mutant of Arabidopsis 

thaliana. J. Biol. Chem. 268, 24099-24105. 

Jones, J.D.G., Shlumukov, L. , Carland, F. , English, J., 
Scofield, S., Bishop, G.J., Harrison, K. (1992) Effective 
) vectors for transformation, expression of heterologous genes, 
and assaying transposon excision in transgenic plants. 
Transgenic Res. 1,285-297. 

Knutson, D.S., Thompson, G.A., RadXe, S.E., Johnson, W.B., 
i Knauf, V.c, Kridl, J.C. (1992) Proc. Natl. Acad. Sci. USA 89, 
2624-2628. 

Koncz, C. , Schell, J. (1986) The promoter of T L -DNA gene 5 

- 62 - 



BNSOOCIO: <WO 9610Q7SAtJ_> 



WO 96/10075 PCI7US95/1 1855 

* 

controls the tissu -specific expression of chimeric genes 
carried by a novel type of Agrobacterium binary vector. Mol. 
Gen. Genet. 204, 383-396. 

Miquel, M. Browse, J. (1992) Arabidopsis mutants deficient in 
polyunsaturated fatty acid synthesis. J. Biol. Chem. 267, 
1502-1509. 

Murray, M.G., Thompson, W.F. (1980) Rapid isolation of high 
molecular weight plant DNA. Nucl. Acid Res. 8,4321-4325. 

Okuley, J., Lightner, J., Feldman, K. , Yadav, N . , Lark, E. , 
Browse, J. (1994) Arabidopsis FAD 2 gene encodes the enzyro 
that is essential for polyunsaturated lipid 

Sambrook, J., Fritsch, E.F., and Maniatis, T. , Molecular 
Cloning: a Laboratory Manual, 2nd ed., Cold Spring Harbor 
Laboratory Press, 1989. 

Smith C.R., Jr. (1985) Unusual seed oils and their fatty 
acids, in Fatty Acids, Pryde E.H., Ed., American Oil Chemists' 
Society, Champaign, Second edition, pp 29-47. 

van de Loo, F.J., Fox, B.G., Somerville, C. (1993) Unusual 
fatty acids, in Lipid Metabolism in Plants, T.S. Moore Jr., 
Ed., CRC Press, Boca Raton, pp9 1-126. 

- 63 - 



9610075A1 I 



WO 96/10075 



PCT/US95/11855 



van de Loo, F.N. , Turner, S. , Somerville, C.R, (1995) An 
oleate 12 -hydroxylase from castor (Ricinus communis I>. ) is a 
fatty acyl deSaturase homolog. Proc. Natl. Acad. Sci. USA 
92,6743-6747 

v n Heijne, G. (1985) Signal sequences. J. Mol. Biol. 184,99- 
105 

Atsmon, D. (1989) Castor, in Oil Crops of the World, Robbelen, 
G. , Downey, K.R. , and Ashri, A., Eds. , . McGraw-Hill , New York, 
pp. 438-447. 

Bechtold, N. , Ellis, J. and Pelletier, G. (1993) In Planta 
Agrobacterium mediated gene transfer by infiltration of adult 

Arabidopsis thaliana plants. C.R. Acad. Sci. Paris 316, 
1194-1199 . 



: <WO 9610075A1 J_> 



- 64 - 



WO 96/10075 



PCT/US95/I1855 



SEQUENCE LISTING 



( 1 ) GENERAL INFORMATION : 



(i) APPLICANT: Somerville, Chris 
Broun, Pierre 



(ii) TITLE OF INVENTION: PRODUCTION OF HYDROXYLATED FATTY ACIDS 
IN GENETICALLY MODIFIED PLANTS 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : CUSHMAN DARBY & CUSHMAN, LLP 

(B) STREET: 1100 NEW YORK AVENUE, NW 

(C) CITY: WASHINGTON 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005-3918 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version //1.25 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US95/11855 

(B) FILING DATE: 25-SEP-1995 

(C) CLASSIFICATION: 

{viiij ATTORNEY /AGENT INFORMATION: 
{A) NAME: Kokulis, Paul N . 

(B) REGISTRATION NUMBER: 16,773 

(C) REFERENCE/DOCKET NUMBER: 1220/213781 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-861-3000 

(B) TELEFAX: 202-822-0944 

(2) INFORMATION FOR SEQ ID NO:l 

(i) SEQUENCE CHARACTERISTICS: 



(A) 
(B) 
(C) 
(D) 



LENGTH: 
TYPE: 

STRANDEDNESS: 
TOPOLOGY: 



543 nucleotides 

nucleotide 

single 

linear 



(ii) 



MOLECULE TYPE: 



CDNA 



Ui) 



SEQUENCE DESCRIPTION: SEQ ID NO:l: 



TA7TGGCACC GGCGGCACCA TTCCAACAAT GGATCCCTAG AAAAAGATGA AGTCTTTGTC 



60 



CCACCTAAGA AAGCTGCAGT CANATGCTAT GTCAAATACC TCAACAACCC TCTTGGACGC 



120 
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ATTCTGGTGT TAACAGTTCA GTTTATCCTC GGGTGGCCTT TGTATCTAGC CTTTAATGTA 180 

TCAGGTAGAC CTTATGATGG TTTCGCTTCA CATTTCTTCC CTCATGCACC TATCTTTAAG 240 

GACCGTGAAC GTCTCCAGAT ATACATCTCA GATGCTGGTA TTCTAGCTGT CTGTTATGGT 300 

CTTTACCGTT ACGCTGCTTC ACAAGGATTG ACTGCTATGA TCTGCGTCTA CGGAGTACCG 360 

CTTTTGATAG TGAACTTTTT CCTTGTCTTG GTCACTTTCT TGCAGCACAC TCATCCTTCA A 20 

TTACCTCACT ATGATTCAAC CGAGTGGGAA TGGATTAGAG GAGCTTTGGT IACGGTAGAC 480 

AGAGACTATG GAATCTTGAA CAAGGTGTTT CACAACATAA CAGACACCCA CGTAGCACAC 540 

C AC 543 

(2J INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 nucleotides 

(BJ TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TATAGGCACC GGAGGCACCA TTCCAACACA GGATCCCTCG AAAGAGATGA AGTATTTGTC 60 

CCAAAGCAGA AATCCGCAAT CAAGTGGTAC GGCGAATACC TCAACAACCC TCCTGGTCGC 120 

ATCATGATGT TAACTGTCCA GTTCGTCCTC GGATGGCCCT TGTACTTAGC CTTCAACGTT 180 

TCTGGCAGAC CCTACAATGG TTTCGCTTCC CATTTCTTCC CCAATGCTCC TATCTACAAC 240 

GACCGTGAAC GCCTCCAGAT TTACATCTCT GATGCTGGTA TTCTAGCCGT CTGTTATGGT 300 

CTTTACCGTT ACGCTGTTGC ACAAGGACTA GCCTCAATGA TCTGTCTAAA CGGAGTTCCG 3 6C 

CTTCTGATAG TTAACTTTTT CCTCGTCTTG ATCACTTACT TACAACACAC TCACCCTGCG 4^0 

TTGCCTCACT ATGATTCATC AGAGTGGGAT TGGCTTAGAG GAGCTTTAGC TACTGTAGAC 480 

AGAGACTATG GAATCTTGAA CAAGGTGTTC CATAACATCA CAGACACCCA CGTCGCACAC 540 

CACT 544 

(2) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1740 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 
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ATGAAGCTTT ATAAGAACTT AGTTTTCTCT GGTGACAGAG AAATTNTGTC AATTGGTAGT 60 

GACAGTTGAA GCAACAGGAA CAACAAGGAT GGTTGGTGKT GATGCTGATG TGGTGATGTG 120 

TTATTCATCA AATACTAAAT ACTACATTAC TTGTTGCTGC CTACTTCTCC TATTTCCTCC 180 

GCCACCCATT TTGGACCCAC GANCCTTCCA TTTAAACCCT CTCTCGTGCT ATTCACCAGA 240 

AGAGAAGCCA AGAGAGAGAG AGAGAGAATG TTCTGAGGAT CATTGTCTTC TTCATCGTTA 300 

TTAACGTAAG TTTTTTTTGA CCACTCATAT CTAAAATCTA GTACATGCAA TAGATTAATG 360 

ACTGTTCCTT CTTTTGATAT TTTCAGCTTC TTGAATTCAA GATGGGTGCT GGTGGAAGAA A 20 

TAATGGTTAC CCCCTCTTCC AAGAAATCAG AAACTGAAGC CCTAAAACGT GGACCATGTG A 80 

AGAAACCACC ATTCACTGTT AAAGATCTGA AGAAAGCAAT CCCACAGCAT TGTTTCAAGC 5 AO 

GCTCTATCCC TCGTTCTTTC TCCTACCTTC TCACAGATAT CACTTTAGTT TCTTGCTTCT. 600 

ACTACGTTGC CACAAATTAC TTCTCTCTTC TTCCTCAGCC TCTCTCTACT TACCTAGCTT 660 

GGCCTCTCTA TTGGGTATGT CAAGGCTGTG TCTTAACCGG TATCTGGGTC ATTGGCCATG 720 

AATGTGGTCA CCATGCATTC AGTGACTATC AATGGGTAGA TGACACTGTT GGTTTTATCT 780 

TCCATTCCTT CCTTCTCGTC CCTTACTTCT CCTGGAAATA CAGTCATCGT CGTCACCATT 8A0 

CCAACAATGG ATCTCTCGAG AAAGATGAAG TCTTTGTCCC ACCGAAGAAA GCTGCAGTCA 900 

AATGGTATGT TAAATACCTC AACAACCCTC TTGGACGCAT TCTGGTGTTA ACAGTTCAGT 960 

TTATCCTCGG GTGGCCTTTG TATCTAGCCT TTAATGTATC AGGTAGACCT TATGATGGTT 1020 

TCGCTTCACA TTTCTTCCCT CATGCACCTA TCTTTAAAGA CCGAGAACGC CTCCAGATAT 1080 

ACATCTCAGA TGCTGGTATT CTAGCTGTCT GTTATGGTCT TTACCGTTAC GCTGCTTCAC 11 AO 

AAGGATTGAC TGCTATGATC TGCGTCTATG GAGTACCGCT TTTGATAGTG AACTTTTTCC 1200 

TTGTCTTGGT AACTTTCTTG CACCACACTC ATCCTTCGTT ACCTCATTAT GATTCAACCG 1260 

AGTGGGAATG GATTAGAGGA GCTTTGGTTA CGGTAGACAG AGACTATGGA ATATTGAACA 1320 

AGGTGTTCCA TAACATAACA GACACACATG TGGCTCATCA TCTCTTTGCA ACTATACCGC 1380 

ATTATAACGC AATGGAAGCT ACAGAGGCGA TAAAGCCAAT ACTTGGTGAT TACTACCACT 1AA0 

TCGATGGAAC ACCGTGGTAT GTGGCCATGT ATAGGGAAGC AAAGGAGTGT CTCTATGTAG 1500 

AACCGGATAC GGAACGTGGG AAGAAAGGTG TCTACTATTA CAACAATAAG TTATGAGGCT 1560 

GATAGGGCGA GAGAAGTGCA ATTATCAATC TT-CATTTCCA TGTTTTAGGT GTCTTGTTTA 1620 

AGAAGCTATG CTTTGTTTCA ATAATCTCAG AGTCCATNTA GTTGTGTTCT GGTGCATTTT 1680 

GCCTAGTTAT GTGGTGTCGG AAGTTAGTGT TCAAACTGCT TCCTGCTGTG CTGCCCAGTG 17AO 
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AAGAACAAGT TTACGTGTTT AAAATACTCG GAACGAATTG ACCACAANAT ATCCAAAACC 1800 
GGCTATCCGA ATTCCATATC CGAAAACCGG ATATCCAAAT TTCCAGAGTA CTTAG 1835 

(2) INFORMATION FOR SEQ ID NO: 4 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Ala Gly Gly Arg He Met Val Thr Pro Ser Ser Lys Lys Ser 
15 10 15 

Glu Thr Glu Ala Leu Lys Arg Gly Pro Cys Glu Lys Pro Pro Phe Thr 
20 25 30 

Val Lys Asp Leu Lys Lys Ala He Pro Gin His Cys Phe Lys Arg Ser 
35 40 45 

He Pro Arg Ser Phe Ser Tyr Leu Leu Thr Asp He Thr Leu Val Ser 
50 55 60 

Cys Phe Tyr Tyr Val Ala Thr Asn Tyr Phe Ser Leu Leu Pro Gin Pro 
6o 70 75 80 

Leu Ser Thr Tvr Leu Ala Trp Pro Leu Tyr Trp Val Cys Gin Gly Cys 
85 90 95 

Val Leu Thr Gly He Trp Val He Gly His Glu Cys Gly His His Ala 
100 105 110 

Phe Ser Asp Tvr Gin Trp Val Asp Asp Thr Val Gly Phe He Phe His 
115 120 125 

Ser Phe Leu Leu Val Pro Tyr Phe Ser Trp Lvs Tyr Ser His Arg Arg 
130 135 140 

His Kis Ser Asn Asn Gly Ser Leu Glu Lys Asp Glu Val Phe Val Pro 
145 150 155 160 

Pro Lvs Lys Ala Ala Val Lys Trp Tyr Val Lys Tyr Leu Asn Asn Pro 
165 170 175 

Leu GIv Arg He Leu Val Leu Thr Val Gin Phe He Leu Gly Trp Pro 
180 185 190 

Leu Tvr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Phe Ala 
195 200 205 

Ser His Phe Phe Pro His Ala Pro He Phe Lys Asp Arg Glu Arg Leu 
210 215 220 
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Gin lie Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

Tyr Arg Tyr Ala Ala Ser Gin Gly Leu Thr Ala Met He Cys Val Tyr 
245 250 255 

Gly Val Pro Leu Leu He Val Asn Phe Phe Leu Val Leu Val Thr Phe 
260 265 270 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Thr Glu Trp 
275 280 285 

Glu Trp He Arg Gly Ala Leu Val Thr Val Asp Arg Asp Tyr Gly He 
290 295 300 

Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 
305 310 ■ 315 320 

Leu Phe Ala Thr He Pro His Tyr Asn Ala Met Glu Ala Thr Glu Ala. 

325 330 335 

He Lys Pro He Leu Gly Asp Tyr Tyr His Phe Asp Gly Thr Pro Trp 
340 3A5 350 

Tyr Val Ala Met Tyr Arg Glu Ala Lys Glu Cys Leu Tyr Val Glu Pro 
355 360 365 

Asp Thr Glu Arg Gly Lys Lys Gly Val Tyr Tyr Tyr Asn Asn Lys Leu 
370 375 380 



(2) IKFORKATI0N FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 387 amino acids 
fB) TYPE: amino acid 
{ D ) TOPOLOGY: linear 



(xi! SEQUENCE DESCRIPTION* : SEQ ID NO: 5: 

Met Gly Gly Gly Gly Arg Met Ser Thr Val He Thr Ser Asn Asn Ser 
15 10 15 

Glu Lys Lys Gly Gly Ser Ser His Leu Lys Arg Ala Pro His Thr Lys 
20 25 30 

Pro Pro Phe Thr Leu Gly Asp Leu Lys Arg Ala lie Pro Pro His Cys 
35 40 45 

Phe Glu Arg Ser Phe Val Arg Ser Phe Ser Tyr Val Ala Tyr Asp Val 
50 55 60 

Cys Leu Ser Phe Leu Phe Tyr Ser lie Ala Thr Asn Phe Phe Pro Tyr 
65 70 75 80 
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lie Ser Ser Pro Leu Ser Tyr Val Ala Trp Leu Val Tyr Trp Leu Phe 
85 90 95 

Gin Gly Cys He Leu Thr Gly Leu Trp Val He Gly His Glu Cys Gly 
100 105 110 

His His Ala Phe Ser Glu Tyr Gin Leu Ala Asp Asp He Val Gly Leu 
115 120 125 

He Val His Ser Ala Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser 
130 135 HO 

His Arg Arg His His Ser Asn He Gly Ser Leu Glu Arg Asp Glu Val 
145 150 155 160 

Phe Val Pro Lys Ser Lys Ser Lys He Ser Trp Tyr Ser Lys Tyr Ser 
165 170 175 

Asn Asn Pro Pro Gly Arg Val Leu Thr Leu Ala Ala Thr Leu Leu Leu 
180 185 190 

Gly Trp Pro Leu Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp 
195 200 205 

Arg Phe Ala Cys His Tyr Asp Pro Tyr Gly Pro He Phe Ser Glu Arg 
210 215 220 

Glu Arg Leu Gin He Tyr He Ala Asp Leu Gly He Phe Ala Thr Thr 
225 230 235 240 

Phe Val Leu Tvr Gin Ala Thr Met Ala Lys Gly Leu Ala Trp Val Met 
245 250 255 

Arg He Tvr Gly Val Pro Leu Leu lie Val Asn Cys Phe Leu Val Met 
260 265 270 

lie Thr Tvr Leu Gin Kis Thr His Pro Ala He Pro Arg Tyr Gly Ser 
275 280 285 

Ser Glu Trp Asp Trp Leu Arg Gly Ala Met Val Thr Val Asp Arg Asp 
290 295 300 

Tyr Gly Val Leu Asn Lys Val Phe His Asn He Ala Asp Thr His Val 
305 310 315 320 

Ala His His Leu Phe Ala Thr Val Pro His Tyr His Ala Met Glu Ala 
325 330 335 

Thr Lys Ala He Lys Pro lie Met Gly Glu Tyr Tyr Arg Tyr Asp Gly 
340 345 350 

Thr Pro Phe Tyr Lys Ala Leu Trp Arg Glu Ala Lys Glu Cys Leu Phe 
355 360 365 

Val Glu Pro Asp Glu Gly Ala Pro Thr Gin Gly Val Phe Trp Tyr Arg 
370 375 380 
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Asn Lys Tyr 
385 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION*: SEQ ID N0:6: 

Met Gly Ala Gly Gly Arg Met Pro Val Pro Thr Ser Ser Lys Lys Ser 
15 10 15 

Glu Thr Asp Thr Thr Lys Arg Val Pro Cys Glu Lys Pro Pro Phe Ser 
20 25 30 

Val Gly Asp Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

He Pro Arg Ser Phe Ser Tyr Leu He Ser Asp He He lie Ala Ser 
50 55 60 

Cys Phe Tyr Tyr Val Ala Thr Asn Tyr Phe Ser Leu Leu Pro Gin Pro 
65 70 75 80 

Leu Ser Tyr Leu Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

Leu Thr Gly He Trp Val He Ala His Glu Cys Gly His His Ala Phe 
100 105 110 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 

Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 
130 135 HO 

His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

Gin Lys Ser Ala He Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 
165 170 175 

Gly Arg He Met Met Leu Thr Val Gin Phe Val Leu Gly Trp Pro Leu 
180 185 190 

Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Phe Ala Cys 
195 200 205 

His Phe Phe Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu Gin 
210 215 220 

He Tyr Leu Ser Asp Ala Gly lie Leu Ala Val Cys Phe Gly Leu Tyr 
225 230 235 240 
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Arg Tyr Ala Ala Ala Gin Gly Met Aia Ser Met lie Cys Leu Tvr Gly 
245 250 255 

Val Pro Leu Leu lie Val Asn Ala Phe Leu Val Leu He Thr Tyr Leu 
260 265 270 

Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp Asp 
275 280 285 

-Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly lie Leu 
290 295 300 

Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His Leu 
305 310 315 320 

Phe Ser Thr Ket Pro His Tyr Asn Ala Met Glu Ala Thr Lys Ala He 
325 330 335 

Lys Pro He Leu Gly Asp Tyr Tyr Gin Phe Asp Gly Thr Pro Trp Tyr 
340 345 350 

Val Ala Met Tyr Arg Glu Aia Lys Glu Cys He Tyr Val Glu Pro Asp 
355 360 ' 365 

Arg Glu Gly Asp Lys Lys Gly Val Tyr Trp Tyr Asn Asn Lys Leu 
370 375 380 

(2) INFORMATION" FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 



(xiJ SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
15 10 15 

Glu Thr Asp Asn He Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 

Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

He Pro Arg Ser Phe Ser His Leu He Trp Asp He He He Ala Ser 
50 55 60 

Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro Asn Pro 
65 70 75 80 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

Leu Thr Gly Val Trp Val He Ala His Glu Cys Gly His Ala Ala Phe 
100 105 110 
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Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu lie Phe His Ser 
115 120 125 

Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 
130 135 140 

His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Arg 
145 150 155 160 

Arg Ser Gin Thr Ser Ser Gly Thr Ala Ser Thr Ser Thr Thr Phe Gly 
165 170 175 

Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu Tyr 
180 185 190 

Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala Cys 
195 200 205 

His Phe His Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu Gin 
210 215 220 

He Tyr He Ser Asp Ala Gly lie Leu Ala Val Cys Tyr Gly Leu Leu 
225 230 235 240 

Pro Tyr Ala Ala Val Gin Gly Val Ala Ser Met Val Cys Phe Leu Arg 
245 250 255 

Val Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He Thr Tyr Leu 
260 265 270 

Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp Asp 
275 280 285 

Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He Leu 
290 295 300 

Asn Gin Gly Phe His Asn lie Thr Asp Thr His Glu Ala His His Leu 
305 310 315 320 

Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lvs Ala He 
325 330 335 

Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val Val 
340 345 350 

Lvs Ala Met Trp Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro Asp 
355 360 365 

Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu Xaa 
370 375 380 



(2) INFORMATION FOR SEQ ID NO: 8: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xx) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ser Leu Leu Thr Ser Phe Ser Tyr Val Val Tvr Asp Leu Ser Phe Ala 
15 10 15 

Phe He Phe Tyr lie Ala Thr Thr Tyr Phe His Leu Leu Pro Gin Pro 
20 25 30 

Phe Ser Leu lie Ala Trp Pro lie Tyr Trp Val Leu Gin Gly Cvs Leu 
35 AO 45 

Leu Thr Arg Val Cys Gly His His Ala Phe Ser Lys Tyr Gin Trp Val 
50 55 60 

Asp Asp Val Val Gly Leu Thr Leu His Ser Thr Leu Leu Val Pro Tyr 
65 70 75 80 

Phe Ser Trp Lys He Ser His Arg Arg His His Ser Asn Thr Glv Ser 
85 90 95 

Leu Asp Arg Asp Glu Arg Val Lys Val Ala Trp Phe Ser Lys Tyr Leu 
100 105 110 

Asn Asn Pro Leu Gly Arg Ala Val Ser Leu Leu Val Thr Leu Thr He 
115 120 125 

Gly Trp Pro Met Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tvr Asp 
130 135 140 

Ser Phe Ala Ser His Tyr His Pro Tyr Arg Val Arg Leu Leu lie Tyr 
145 150 155 160 

Val Ser Asp Val Ala Leu Phe Ser Val Thr Tyr Ser Leu Tyr Arg Val 
165 170 175 

Ala Thr Leu Lvs Gly Leu Val Trp Leu Leu Cys Val Tyr Gly Val Pro 
180 185 190 

Leu Leu He Val Asn Gly Phe Leu Val Thr He Thr Tyr Leu Arg Val 
195 200 205 

His Tyr Asp Ser Ser Glu Trp Asp Trp Leu Lys Gly Ala Leu Ala Thr 
210 215 220 

Ket Asp Arg Asp Tyr Gly He Leu Asn Lys Val Phe His His lie Thr 
225 230 235 240 

Asp Thr His Val Ala His His Leu Phe Ser Thr Met Pro His Tvr His 
245 250 255 

Leu Arg Val Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Asp Thr 
260 265 270 

Pro Phe Tyr Lys Ala Leu Trp Arg Glu Ala Arg Glu Cys Leu Tvr Val 
275 280 285 
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Glu Pro Asp Glu Gly Thr Ser Glu Lys Gly Val Tyr Trp Tvr Arg Asn 
290 295 300 

Lys Tyr Leu Arg Val 
305 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 302 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Phe Ser Tyr Val Val Tyr Asp Leu Thr lie Ala Phe Cvs Leu Tyr Tyr 
15 10 15 

Val Ala Thr His Tyr Phe His Leu Leu Pro Gly Pro Leu Ser Phe Arg 
20 25 30 

Gly Met Ala lie Tyr Trp Ala Val Gin Gly Cys lie Leu Thr Gly Val 
35 40 Ub 

Trp Val Val Ala Phe Ser Asp Tyr Gin Leu Leu Asp Asp He Val Gly 
50 55 60 

Leu lie Leu His Ser Ala Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr 
65 70 75 80 

Ser His Arg Arg His His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu 
85 90 95 

Val Phe Val Pro Lys Val Ser Lys Tyr Leu Asn Asn Pro Pro Gly Arg 
100 105 110 

Val Leu Thr Leu Ala Val Thr Leu Thr Leu Gly Trp Pro Leu Tyr Leu 
U5 120 125 

Ala Leu Asn Val Ser Gly Arg Pro Tyr Asp Arg Phe Ala Cys His Tyr 
130 135 U0 

Asp Pro Tyr Gly Pro He Tyr Ser Val He Ser Asp Ala Gly Val Leu 
U5 150 155 160 

Ala Val Val Tyr Gly Leu Phe Arg Leu Ala Met Ala Lys Gly Leu Ala 
165 170 175 

Trp Val Val Cvs Val Tyr Gly Val Pro Leu Leu Val Val Asn Gly Phe 
180 185 190 

Leu Val Leu He Thr Phe Leu Gin His Thr His Val Ser Glu Trp Asp 
195 200 205 

Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He Leu 
210 215 220 

- 75 - 

SUBSTITUTE SHEET {RULE 26) 



BNSOOCIO: <:WO 9610O75A1 J. 



WO 96/10075 



PCIYUS9S/11855 



Asn Lys Val Phe His Asn lie Thr Asp Thr Kis Val Ala His His Leu 
225 230 235 240 

Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Val Glu Tyr 
245 250 255 

Tyr Arg Phe Asp Glu Thr Pro Phe Val Lys Ala Met Trp Arg Glu Ala 
260 265 270 

Arg Glu Cys He Tyr Val Glu Pro Asp Gin Ser Thr Glu Ser Lys Gly 
275 280 285 

Val Phe Trp Tyr Asn Asn Lys Leu Ala Met Glu Ala Thr Val 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Gly Ala Gly Gly Arg Met Thr Glu Lys Glu Arg Glu Lys Gin Glu 
15 10 15 

Gin Leu Ala Arg Ala Thr Gly Gly Ala Ala Met Gin Arg Ser Pro Val 
20 25 30 

Glu Lys Pro Pro Phe Thr Leu Gly Gin He Lys Lys Ala He Pro Pro 
35 40 45 

His Cvs Phe Giu Arg Ser Val Leu Lys Ser Phe Ser Tyr Val Val His 
50 55 60 

Asp Leu Val He Ala Ala Ala Leu Leu Tyr Phe Ala Leu Ala He He 
65 70 75 80 

Pro Ala Leu Pro Ser Pro Leu Arg Tyr Ala Ala Trp Pro Leu Tyr Trp 
85 90 95 

He Ala Gin Gly Ala Phe Ser Asp Tyr Ser Leu Leu Asp Asp Val Val 
100 105 110 

Gly Leu Val Leu His Ser Ser Leu Met Val Pro Tyr Phe Ser Trp Lys 
115 120 125 

Tyr Ser His Arg Arg His His Ser Asn Thr Gly Ser Leu Glu Arg Asp 
130 135 140 

Glu Val Phe Val Pro Lys Lys Lys Glu Ala Leu Pro Trp Tyr Thr Pro 
145 150 155 160 

Tyr Val Tyr Asn Asn Pro Val Gly Arg Val Val His He Val Val Gin 
165 170 175 
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Leu Thr Leu Gly Trp Pro Leu Tyr Leu Ala Thr Asn Ala Ser Gly Arg 
180 185 190 

Pro Tyr Pro Arg Phe Ala Cys His Phe Asp Pro Tyr Gly Pro He Tyr 
195 200 205 

Asn Asp Arg Glu Arg Ala Gin He Phe Val Ser Asp Ala Gly Val Val 
210 215 220 

Ala Val Ala Phe Gly Leu Tyr Lys Leu Ala Ala Ala Phe Gly Val Trp 
225 230 235 240 

Trp Val Val Arg Val Tyr Ala Val Pro Leu Leu Lie Val Asn Ala Trp 
245 250 255 

Leu Val Leu He Thr Tyr Leu Gin His Thr His Pro Ser Leu Pro His 
260 265 270 

Tvr Asp Ser Ser Glu Trp Asp Trp Leu Arg Gly Ala Leu Ala Thr Met 
275 280 285 

Asp Arg Asp Tyr Gly lie Leu Asn Arg Val Phe His Asn lie Thr Asp 
290 295 300 

Thr His Val Ala His His Leu Phe Ser Thr Met Pro His Tyr His Ala 
305 310 315 320 

Met Glu Ala Thr Lys Ala He Arg Pro He Leu Gly Asp Tyr Tyr His 
325 330 335 

Phe Asp Pro Thr Pro Val Ala Lys Ala Thr Trp Arg Glu Ala Gly Glu 
340 345 350 

Cys lie Tvr Val Glu Pro Glu Asp Arg Lys Gly Val Phe Trp Tyr Asn 
355 360 365 

Lvs Lys Phe Xaa 

370 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

Trp Val Met Ala His Asp Cys Gly His His Ala Phe Ser Asp Tyr Gin 
15 10 15 

Leu Leu Asp Asp Val Val Gly Leu He Leu His Ser Cys Leu Leu Val 
20 25 30 

Pro Tyr Phe Ser Trp Lys His Ser His Arg Arg His His Ser Asn Thr 
35 40 45 
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Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys Lys Lvs Ser Ser 
50 55 60 

He Arg Trp Tyr Ser Lys Tyr Leu Asn Asn Pro Pro Gly Arg lie Met 
65 70 75 80 

Thr He Ala Val Thr Leu Ser Leu Gly Trp Pro teu Tyr Leu Ala Phe 
85 90 95 

Asn Val Ser Gly Arg Pro Tyr Asp Arg Phe Ala Cys His Tyr Asp Pro 
100 105 110 

Tyr Gly Pro He Tyr Asn Asp Arg Glu Arg lie Glu He Phe He Ser 
115 120 125 

Asp /.la Gly Val Leu Ala Val Thr Phe Glv Leu Tyr Gin Leu Ala He 
130 135 1A0 

Ala Lys Gly Leu Ala Trp Val Val Cys Val Tvr Gly Val Pro Leu Leu 
145 150 155 160 

Val Val Asn Ser Phe Leu Val Leu lie Thr Phe Leu Gin His Thr His 
165 170 175 

Pro Ala Leu Pro His Tyr Asp Ser Ser Glu Trp Asp Trp Leu Arg Gly 
180 185 190 

Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He Leu Asn Lys Val Phe 
195 200 205 

His Asn lie Thr Asp Thr Gin Val Ala His His Leu Phe Thr K.et Pro 
210 215 220 

(2) IKFORKATION FOR SEQ ID NO: 12 

(i) SEQUENCE CHARACTERISTICS: 



(A) 
(BJ 
tC) 
(DJ 



LENGTH: 
TYPE: 

STRANDEDKESS: 
TOPOLOGY: 



20 nucleotides 
nucleotide 
single 
linear 



Uii 



KOLECULE TYPE: 



DNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GCTCTTTTGT GCGCTCATTC 



20 



INFORMATION FOR SEQ ID NO: 13 



) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) ST RAND ED NESS : 

(D) TOPOLOGY: 



20 nucleotides 
nucleotide 
single 
linear 



(ii) 



KOLECULE TYPE: 



DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGGTACCAGA AAACGCCTTG 20 



(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:H: 

TAYVSNCAYM GNMGNCAYCA 20 

(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 nucleotides 

( B ) TYPE : nucleotide 

(C) STRANDEDNESS: single 
|D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 

RTGRTGNGCN ACRTGNGTRT C 21 
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CLAIMS 

What is claimed is: 

1. An isolated nucleic acid fragment comprising a 
nucleic acid sequence encoding a fatty acid hydroxylase with 
an amino acid identity of 60% or greater to the polypeptide 
encoded by SEQ ID NO: 4. 

2. The isolated nucleic acid fragment of Claim 1, 
wherein the amino acid identity is 90% or greater to the 
polypeptide encoded by SEQ ID NO: 4. 

3. The isolated nucleic acid fragment of Claim 1, 
wherein the amino acid identity is 100% of the polypeptide 
encoded by SEQ ID NO: 4. 

4. An isolated nucleic acid fragment having a nucleic 
acid identity of 90% or greater of a nucleotide sequence of 
SEQ ID NO:l, 2, or 3. 

5. An isolated nucleic acid having a nucleotide 
sequence of SEQ ID NO:l, SEQ ID NO:2 or SEQ ID N0:3. 

6. The isolated nucleic acid fragment of Claim 1, 
wherein said fragment is isolated from an oil-producing plant 
species. 
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7. A chimeric gene capabl of causing altered levels 
of ricinoleic acid in a transformed plant cell, said chimeric 
gene comprising a nucleic acid fragment of Claim 1, said 
fragment operably linked to suitable regulatory sequences. 

8. A chimeric gene capable of causing altered levels of 
lesquerolic acid in a transformed plant cell, said chimeric 
gene comprising a nucleic acid fragment of Claim 1, said 
fragment operably linked to suitable regulatory sequences. 

9. A chimeric gene capable of causing altered levels of 
fatty acids in a transformed plant cell, said chimeric gene 
comprising a nucleic acid fragment of Claim 1, said fragment 
operably linked to suitable regulatory sequences. 

10. A chimeric gene capable of causing altered levels of 
fatty acids in a transformed plant cell, said chimeric gene 
comprising a nucleic acid fragment of Claim 2, said fragment 
operably linked to suitable regulatory sequences. 

11. A chimeric gene capable of causing altered levels of 
fatty acids in a transformed plant cell, said chimeric gene 
comprising a nucleic acid fragment of Claim 4, said fragment 
operably linked to suitable regulatory sequences. 

12. Plants containing the chimeric gene of any one of 
claims 7, 8, 9, 10 or 11. 
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13. Oil obtained from seeds of the plants of claim 12. 



14. The isolated nucleic acid fragment of Claim 1, 
wherein said fragment is obtainable from Ricinus communis (L.) 
(Castor) . 

15. The isolated nucleic acid fragment of Claim l, 
wherein said fragment is obtainable from Lesquerella fendl ri. 

16. A method of producing seed oil containing altered 
levels of hydroxylated fatty acids comprising: 

(a) transforming a plant cell of an oil-producing species 
with a chimeric gene containing an isolated nucleic acid of 
Claim 1; 

(b) growing fertile plants from the transformed plant 
cells of step (a) ; 

(c) screening progeny seeds from the fertile plants of 
step (b) for the desired levels of hydroxylated fatty acids; 
and 

(d) processing the progeny seed of step (c) to obtain 
seed oil containing altered levels of unsaturated fatty acids. 

17. The method of Claim 16, wherein said crop plant is 
selected from the group consisting of rapeseed, Crambe, 
Brasslca juncea, Canola, flax, sunflower, saf flower, cotton, 
cuphea, soybean, peanut, coconut, oil palm and corn. 
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18. A method of producing seed oil containing altered 
levels of hydroxy lated fatty acids comprising: 

(a) transforming a plant cell of an oil-producing species 
with a chimeric gene containing the nucleotide sequence of SEQ 
ID NO:l, SEQ ID NO: 2 or SEQ ID NO:3; 

(b) growing fertile plants from the transformed plant 
cells of step (a) ; 

(c) screening progeny seeds from the fertile plants of 
step (b) for the desired levels of hydroxy lated fatty acids; 
and 

(d) processing the progeny seed of step (c) to obtain 
seed oil containing altered levels of unsaturated fatty acids. 

19. The method of Claim 18, wherein said crop plant is 
selected from the group consisting of rapeseed, Crambe, 
Brassica juncea, Canola, flax, sunflower, saf flower, cotton, 
cuphea, soybean, peanut, coconut, oil palm and corn. 

20. A triglyceride oil from a plant selected from the 
group consisting of rapeseed, Crambe, Brassica juncea, Canola, 
flax, sunflower, cotton, cuphea, soybean, peanut, coconut, oil 
palm and corn, wherein the fatty acid composition of the oil 
has been modified to contain hydroxy lated fatty acids by a 
method comprising growing a plant cell having integrated in 
its genome a DNA construct containing a plant hydroxylase 
encoding sequence of Claim 1, under conditions which will 
permit the transcription and translation of said plant 
hydroxylase in the plant cells. 
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21. A method to isolate nucleic acid fragments encoding 
fatty acid hydroxylases comprising: 

(a) comparing SEQ ID NO: 4 and other fatty acid 
hydroxylase sequences and fatty acid desaturases; 

(b) identifying conserved sequences of 4 or more amino 
acids obtained in step (a) ; 

(c) designing degenerate oligomers based on the conserved 
sequences identified in step (b) ; 

(d) using the degenerate oligomers of step (c) to isolate 
sequences encoding fatty acid hydroxylases by sequence 
dependent protocols; 

(e) obtaining the deduced amino acid sequence of the 
encoded gene product from the nucleotide sequence of the gene 
and; 

(f) distinguishing hydroxylase genes from desaturase 
genes by analyzing amino acid sequence differences between 
fatty acid desaturases and fatty acid hydroxylases. 
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Figure 1A 
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Figure 4A Mass spectrum of peak 10 from figure 3B 
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Mass spectrum of peak 11 from figure 3B 
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Mass spectrum of peak 12 from figure 3B 
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Mass spectrum of peak 13 from figure 3B 
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LFFAH12 I MGAGGRIM-- --VTPSSKKS - ■ ETE^LKRG PCEKPPFTVK OLKKi i POHC 50 

FAH12 1 MGGGGRMSTV ! TSNilSEKKG --GSSHURA PHTKPPFTLG DLKRAIPPHC SO 

ATFAD2 1 MGAGGRMP-- --VPTSSKKS - - ETDTTKRV PCtKPPFSVG DLKK-iPPHC SO 

BNFAD2 1 MGAGGRMO-- --VSPPSKKS --ETDNIKRV PCETPPFTVG ELKKAIPPHC 50 

GMFAD2-1 1 MGLA-KETTM GGRGRVAKVE VQGKKPLSRV PNTKPPFTVG QLKKAIPPHC 50 

GMFAD2-2 1 MGAGGR TQVPPANRKS - - EVDPLKRV FFEKPQFSLS OIKKAIPPHC 50 

ZMFAD2 1 MGAGGRMTEK £ REKOEOLAR ATGGAAMORS PVEKPPFTLG OIKKAIPPHC SO 

RCFAD2 1 SO 

60 70 80 90 100 

LFFAH12 51 FKRSIPRSFS YLLTDITLVS CFYYVATNYF SLLPOPLSTY LAWPLYWVCO 100 

FAH12 SI FERSFVRSfS YVAYOVCLSF LFYSIATHFF PYISSPLS-Y VAWLVYWLFO 100 

ATFAD2 SI FKRSIPRSFS YLISCIIIAS CFYYVATNYF SLLPOPLS-Y LAWPLYUACQ 100 

BNFAD2 51 FKRSIPRSFS HLIWOI 1 1 AS CFYYVATTYF PLLPNPLS-Y FAWPLYWACO 100 

GMFAD2-1 51 FORSLITSFS YVVYDLSFAF I f Y - I A TTY r HLLPOPFS-L IAWPIYWVLO 100 

GMFA02-2 51 FQRSVLRSFS YVVYCLHAF CLYYVATHYF HLLPGPLS ■ F RGMA I YWAVQ - 100 

ZMFA02 SI FERSV LKSFS YVVKOLVIAA ALLY FALA ! 1 PALPSPLR-Y AAWPLYW I AQ 100 

RCFAD2 51 100 

110 120 130 140 150 

LFFAH12 101 GCVLTGIWVI GHECGHHAFS OYGWVDOTVG FIFHSFLLVP YFSWKYSHRR ISO 

FAH12 101 GCILTGLWVI GHECGHHAFS EYQLAOO I VG LIVHSALLVP YFSWKYSHRR 150 

ATFAD2 101 GCVLTGIWVI AHECGHHAFS OYOWLDOTVG LIFHSFLLVP YFSWKYSHRR ISO 

BNFAD2 101 GCVLTGVWV1 AHECGHAAFS OYOWLDOTVG LIFHSFLLVP YFSWKYSHRR 150 

GMFAD2-1 101 GCLLTGVWVI AHECGHHAFS KYQWVDDVVG LTLHSTLLVP YF$WK!SHRR 150 

GMFA02-2 101 GCILTGVWVI AHECGHHAFS DYQLLOO I VG LILHSALLVP YFSWKYSHRR ISO 

ZMFAD2 101 G AFS OYSLLDDVV'G LVLHSSLMVP YFSWKYSHRR ISO 

RCFAD2 101 WVM AHOCGHHAFS OYOLLDDWG LILHSCLLVP YFSWKHSHRR ISO 
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LFFAH12 151 HHSNNGSLEK DEVFVPPKKA AVKWYVKYL- NNPIGRILVL TVOFUGWPL 200 

FAH12 1S1 HHSNIGSLER DEVFVPKSKS KISWY$KY$- NNPPGRVLTL AATLLLGWPL 200 

ATFA02 151 HHSNTGSLER OEVFVPKOKS AIKWYGKYL- NNPLGR IKML TVQFVLGWPL 200 

BNFAD2 151 HHSNTGSLER 0EVFVPR-R5 OTSSGT AST - STTFGRTVML T VO FT LGWPL " 200 

GMFAD2-1 151 HHSNTGSLDR DEVFVPKPKS KVAWFSKY L - NNPLGRAVSL LVUTIGWPN 200 

GMFAD2-2 151 HHSNTGSLER OEVFVPKOKS C I KWY SKY L - NNPPGRVLTL AVTLTLGWPL 200 
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RCFAD2 151 HHSNTGSLER DEVFVPKKKS SI RWYSKYL- NNPPGRIMTI AVT LSLGWPL ' 200 
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LFFAH12 201 YLAFNVSGRP YOG - FASHFF PHAPI FKDRE RLOIYISDAG ILAVCYGLYR 250 

FAH12 201 YLAFNVSGRP YDR * FACHYO PYGP I FSERE RLOI YIADLG I FATTFV LYQ 250 

ATFA02 201 YLAFNVSGRP YOG- FACHFF PNAP I YNORE RLQIYLSOAG I LAVCFGLYR 2S0 

BNFA02 201 YLAFNVSGRP YDGGFACHFH PNAPI YNDRE RLOIYISDAG I LAVC YGLLP 2S0 

GMFAD2 * 1 201 YLAFNVSGRP YDS-FASHYH PYAPIYSNRE RL L I YVSDV A LFSVTY SLYR 2S0 

GMFAD2-2 201 YLALNVSGRP YDR- FACHYD PYGP I YSDRE RLOIYISDAG VLAVVYGLFR 250 

ZMFAD2 201 YLATNASGRP YPR-FACHFD PYGPIYNORE RAOIFVSDAG VVAVAFGLYK 250 

RCFAD2 201 YLAFNVSGRP YOR- FACHYO PYGPIYNORE RIEIFISOAG V LAVTFGLYO 250 
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260 270 280 290 300 

LFFAH12 251 -AASOGLTi.M ICVYGVPLLI VNFFLVLVTF LOHTHPSlPH YDSTEWEWIR 300 

FAH12 251 iTMAKGLAWV MR I YGVPLL I VNCFLVMITY LQHTHPA I PR YGSSEWDWLR 300 

ATFAD2 251 AAAQGMASM ICLYGVPLLI VNAFLVL I TV LQH'THPSLPH YOSSEWOWLR 300 

BNFA02 251 v AAVQGVASM VCFLRVPLLI VNGFLVLITY LQHTHPSLPH YDSSEWOWLR 300 

GMFA02- 1 251 V AT LKGLVWL LCVYGVPLLI VNGFLVT I TV lOHTHFALPH YDSSEWDWLK 300 

GMFAD2-2 251 L AM A KG LAW V VCVYGVPLLV VNGFLV L I T F LOHTHPALPH YTSSEWOWLR 300 

ZMFAD2 251 LAAAFGVWWV VRVYAVPLLI VNAWLV L I T Y LOHTHPSLPH YOSSEWOWLR 300 

RCFAD2 251 1 A I A KG LAW V VCVYGVPLLV VNSFLV L I TF LQHTHPA LPH YOSSEWOWLR 300 

310 320 330- 340 350 

LFFAH12 301 GALVTVDRDY GI LNKVFHNI TOTHV AHHL F AT I PHYNAME ATEAIKPRG 350 

FAH12 301 GAMVTVOROY GVLNKVFHNI AOTHVAHHLF ATVPHYHAME ATKAIKPIMG 350 

ATFA02 301 GALATVOROY GI LNKVFHN I TDTHVAHHLF STMPHYNAMt ATKAl KPILG 350 

BNFAD2 301 GALATVDRDY GILNOGFHNI TDTHEAHHLF STMPHYHAME ATKAIKPILG 3S0 

GMFAD2-1 301 G ALATMORDY G I LNKVFHH I TDTHVAHHLF STMPHYHAME ATNAIKPILG 350 

GMFA02-2 301 GALATVOROY G I LNKVFHN I TDTHVAHHLF STMPHYHAME ATKAIKPILG 350 

2MFAD2 301 GALATMOROY GI LNRVFHN I TDTHVAHHLF STMPHYHAME ATKAIRPILG 350 

RCFA02 301 GALATVOROY GI LNKVFHNI TOTOVAHHLF 350 

360 370 380 390 400 

LFFAH12 351 DYYHFDGTPW YVAMYREAKE CLYVEPDTER GKKGVYYYNN K- L 400 

FAH12 351 EYYRYOGTPF YKALWREAKE C LFVEPDEGA PTOG VFWYRN KY- 400 

ATFAD2, 351 OYYQFDGTPW YVAMYREAKE CIYVEPOREG OKKGVYWYNN K- L 400 

BNFA02* 351 EYYQFDGTPV VKAMWREAKE CIYVEPOROG EKKGVFWYNN KL* 400 

GMFAD2-1 351 E Y YOFDDTPF YKALWREARE CLYVEPDEGT SEKGVYWY RN KY- 400 

GMFAD2-2 351 E Y YRFOETPF VKAMWREARE CIYVEPOOST ESKGVFWYNN KL - 400 

2MFAD2 .351 DYYHFDPTPV AKATWREAGE C I Y VEPE — DRKGVFWYNK KF* 400 
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Plasmid name: pSLJ44026 
Plasmid size: 25.70 kb 
Constructed by: Jonathon Jones 
Construction date: 19 92 

Comments/References: Transgenic Research 1.285-297 (1992) 
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